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Prologue 


When people today hear the term “information revolution,’ they 
typically think of silicon chips and software code, cellular phones and 
supercomputers. They rarely think of tiny one-celled organisms or the 
rise of animal life. But, while writing these words in the summer of 
2012, I am sitting at the end of a narrow medieval street in Cambridge, 
England, where more than half a century ago a far-reaching information 
revolution began in biology. This revolution was launched by an 
unlikely but now immortalized pair of scientists, Francis Crick and 
James Watson. Since my time as a Ph.D. student at Cambridge during 
the late 1980s, I have been fascinated by the way their discovery 
transformed our understanding of the nature of life. Indeed, since the 
1950s, when Watson and Crick first illuminated the chemical structure 
and information-bearing properties of DNA, biologists have come to 
understand that living things, as much as high-tech devices, depend 
upon digital information—information that, in the case of life, is stored 
in a four-character chemical code embedded within the twisting figure 
of a double helix. 

Because of the importance of information to living things, it has now 
become apparent that many distinct “information revolutions” have 
occurred in the history of life—not revolutions of human discovery or 
invention, but revolutions involving dramatic increases in the 
information present within the living world itself. Scientists now know 
that building a living organism requires information, and building a 
fundamentally new form of life from a simpler form of life requires an 
immense amount of new information. Thus, wherever the fossil record 
testifies to the origin of a completely new form of animal life—a pulse 


of biological innovation—1t also testifies to a significant increase in the 
information content of the biosphere. 

In 2009, I wrote a book called Signature in the Cell about the first 
“information revolution” in the history of life—the one that occurred 
with the origin of the first life on earth. My book described how 
discoveries in molecular biology during the 1950s and 1960s 
established that DNA contains information in digital form, with its four 
chemical subunits (called nucleotide bases) functioning like letters in a 
written language or symbols in a computer code. And molecular biology 
also revealed that cells employ a complex information-processing 
system to access and express the information stored in DNA as they use 
that information to build the proteins and protein machines that they 
need to stay alive. Scientists attempting to explain the origin of life must 
explain how both information-rich molecules and the cell’s information- 
processing system arose. 

The type of information present in living cells—that is, “specified” 
information in which the sequence of characters matters to the function 
of the sequence as a whole—has generated an acute mystery. No 
undirected physical or chemical process has demonstrated the capacity 
to produce specified information starting “from purely physical or 
chemical” precursors. For this reason, chemical evolutionary theories 
have failed to solve the mystery of the origin of first life—a claim that 
few mainstream evolutionary theorists now dispute. 

In Signature in the Cell, I not only reported the well-known impasse 
in origin-of-life studies; I also made an affirmative case for the theory of 
intelligent design. Although we don’t know of a material cause that 
generates functioning digital code from physical or chemical precursors, 
we do know—based upon our uniform and repeated experience—of one 
type of cause that has demonstrated the power to produce this type of 
information. That cause is intelligence or mind. As information theorist 
Henry Quastler observed, “The creation of information is habitually 
associated with conscious activity.”! Whenever we find functional 
information—whether embedded in a radio signal, carved in a stone 
monument, etched on a magnetic disc, or produced by an origin-of-life 


scientist attempting to engineer a self-replicating molecule—and we 
trace that information back to its ultimate source, invariably we come to 
a mind, not merely a material process. For this reason, the discovery of 
digital information in even the simplest living cells indicates the prior 
activity of a designing intelligence at work in the origin of the first life. 

My book proved controversial, but in an unexpected way. Though I 
clearly stated that I was writing about the origin of the first life and 
about theories of chemical evolution that attempt to explain it from 
simpler preexisting chemicals, many critics responded as if I had written 
another book altogether. Indeed, few attempted to refute my book’s 
actual thesis that intelligent design provides the best explanation for the 
origin of the information necessary to produce the first life. Instead, 
most criticized the book as if it had presented a critique of the standard 
neo-Darwinian theories of biological evolution—theories that attempt to 
account for the origin of new forms of life from simpler preexisting 
forms of life. Thus, to refute my claim that no chemical evolutionary 
processes had demonstrated the power to explain the ultimate origin of 
information in the DNA (or RNA) necessary to produce life from 
simpler preexisting chemicals in the first place, many critics cited 
processes at work in already living organisms—in particular, the 
process of natural selection acting on random mutations in already 
existing sections of information-rich DNA. In other words, these critics 
cited an undirected process that acts on preexistent information-rich 
DNA to refute my argument about the failure of undirected material 
processes to produce information in DNA in the first place.? 

For example, the eminent evolutionary biologist Francisco Ayala 
attempted to refute Signature by arguing that evidence from the DNA of 
humans and lower primates showed that the genomes of these 
organisms had arisen as the result of an unguided, rather than 
intelligently designed, process—even though my book did not address 
the question of human evolution or attempt to explain the origin of the 
human genome, and even though the process to which Ayala alluded 
clearly presupposed the existence of another information-rich genome in 
some hypothetical lower primate.* 


Other discussions of the book cited the mammalian immune system 
as an example of the power of natural selection and mutation to generate 
new biological information, even though the mammalian immune 
system can only perform the marvels it does because its mammalian 
hosts are already alive, and even though the mammalian immune system 
depends upon an elaborately preprogrammed form of adaptive capacity 
rich in genetic information—one that arose long after the origin of the 
first life. Another critic steadfastly maintained that “Meyer’s main 
argument” concerns “the inability of random mutation and selection to 
add information to [preexisting] DNA’* and attempted to refute the 
book’s presumed critique of the neo-Darwinian mechanism of biological 
evolution accordingly. 

I found this all a bit surreal, as if I had wandered into a lost chapter 
from a Kafka novel. Signature in the Cell simply did not critique the 
theory of biological evolution, nor did it ask whether mutation and 
selection can add new information to preexisting information-rich 
DNA. To imply otherwise, as many of my critics did, was simply to 
erect a straw man. 

To those unfamiliar with the particular problems faced by scientists 
trying to explain the origin of life, it might not seem obvious why 
invoking natural selection does not help to explain the origin of the first 
life. After all, if natural selection and random mutations can generate 
new information in living organisms, why can it also not do so in a 
prebiotic environment? But the distinction between a biological and 
prebiotic context was crucially important to my argument. Natural 
selection assumes the existence of living organisms with a capacity to 
reproduce. Yet self-replication in all extant cells depends upon 
information-rich proteins and nucleic acids (DNA and RNA), and the 
origin of such information-rich molecules is precisely what origin-of- 
life research needs to explain. That’s why Theodosius Dobzhansky, one 
of the founders of the modern neo-Darwinian synthesis, can state flatly, 
“Pre-biological natural selection is a contradiction in terms.” Or, as 
Nobel Prize-winning molecular biologist and origin-of-life researcher 
Christian de Duve explains, theories of prebiotic natural selection fail 


because they “need information which implies they have to presuppose 
what is to be explained in the first place.””° Clearly, it is not sufficient to 
invoke a process that commences only once life has begun, or once 
biological information has arisen, to explain the origin of life or the 
origin of the information necessary to produce it. 

All this notwithstanding, I have long been aware of strong reasons 
for doubting that mutation and selection can add enough new 
information of the right kind to account for large-scale, or 
“macroevolutionary,” innovations—the various information revolutions 
that have occurred after the origin of life. For this reason, I have found it 
increasingly tedious to have to concede, if only for the sake of 
argument, the substance of claims I think likely to be false. 

And so the repeated prodding of my critics has paid off. Even though 
I did not write the book or make the argument that many of my critics 
critiqued in responding to Signature in the Cell, I have decided to write 
that book. And this is that book. 

Of course, it might have seemed a safer course to leave well enough 
alone. Many evolutionary biologists now grudgingly acknowledge that 
no chemical evolutionary theory has offered an adequate explanation of 
the origin of life or the ultimate origin of the information necessary to 
produce it. Why press a point you never made in the first place? 

Because despite the widespread impression to the contrary— 
conveyed by textbooks, the popular media, and spokespersons for 
official science—the orthodox neo-Darwinian theory of biological 
evolution has reached an impasse nearly as acute as the one faced by 
chemical evolutionary theory. Leading figures in several subdisciplines 
of biology—cell biology, developmental biology, molecular biology, 
paleontology, and even evolutionary biology—now openly criticize key 
tenets of the modern version of Darwinian theory in the peer-reviewed 
technical literature. Since 1980, when Harvard paleontologist Stephen 
Jay Gould declared that neo-Darwinism “is effectively dead, despite its 
persistence as textbook orthodoxy,”’ the weight of critical opinion in 
biology has grown steadily with each passing year. 


A steady stream of technical articles and books have cast new doubt 
on the creative power of the mutation and selection mechanism.* So 
well established are these doubts that prominent evolutionary theorists 
must now periodically assure the public, as biologist Douglas Futuyma 
has done, that “just because we don’t know how evolution occurred, 
does not justify doubt about whether it occurred.”? Some leading 
evolutionary biologists, particularly those associated with a group of 
scientists known as the “Altenberg 16,” are openly calling for a new 
theory of evolution because they doubt the creative power of the 
mutation and natural selection mechanism. !° 

The fundamental problem confronting neo-Darwinism, as with 
chemical evolutionary theory, is the problem of the origin of new 
biological information. Though neo-Darwinists often dismiss the 
problem of the origin of life as an isolated anomaly, leading 
theoreticians acknowledge that neo-Darwinism has also failed to explain 
the source of novel variation without which natural selection can do 
nothing—a problem equivalent to the problem of the origin of 
biological information. Indeed, the problem of the origin of information 
lies at the root of a host of other acknowledged problems in 
contemporary Darwinian theory—from the origin of new body plans to 
the origin of complex structures and systems such as wings, feathers, 
eyes, echolocation, blood clotting, molecular machines, the amniotic 
egg, skin, nervous systems, and multicellularity, to name just a few. 

At the same time, classical examples illustrating the prowess of 
natural selection and random mutations do not involve the creation of 
novel genetic information. Many biology texts tell, for example, about 
the famous finches in the Galapagos Islands, whose beaks have varied 
in shape and length over time. They also recall how moth populations in 
England darkened and then lightened in response to varying levels of 
industrial pollution. Such episodes are often presented as conclusive 
evidence for the power of evolution. And indeed they are, depending on 
how one defines “evolution.” That term has many meanings, and few 
biology textbooks distinguish between them. “Evolution” can refer to 
anything from trivial cyclical change within the limits of a preexisting 


gene pool to the creation of entirely novel genetic information and 
structure as the result of natural selection acting on random mutations. 
As a host of distinguished biologists have explained in recent technical 
papers, small-scale, or “microevolutionary,” change cannot be 
extrapolated to explain large-scale, or “macroevolutionary,” 
innovation.!! For the most part, microevolutionary changes (such as 
variation in color or shape) merely utilize or express existing genetic 
information, while the macroevolutionary change necessary to assemble 
new organs or whole body plans requires the creation of entirely new 
information. As an increasing number of evolutionary biologists have 
noted, natural selection explains “only the survival of the fittest, not the 
arrival of the fittest.”!* The technical literature in biology is now replete 
with world-class biologists!* routinely expressing doubts about various 
aspects of neo-Darwinian theory, and especially about its central tenet, 
namely, the alleged creative power of the natural selection and mutation 
mechanism. 

Nevertheless, popular defenses of the theory continue apace, rarely if 
ever acknowledging the growing body of critical scientific opinion 
about the standing of the theory. Rarely has there been such a great 
disparity between the popular perception of a theory and its actual 
standing in the relevant peer-reviewed scientific literature. Today 
modern neo-Darwinism seems to enjoy almost universal acclaim among 
science journalists and bloggers, biology textbook writers, and other 
popular spokespersons for science as the great unifying theory of all 
biology. High-school and college textbooks present its tenets without 
qualification and do not acknowledge the existence of any significant 
scientific criticism of it. At the same time, official scientific 
organizations—such as the National Academy of Sciences (NAS), the 
American Association for the Advancement of Sciences (AAAS), and 
the National Association of Biology Teachers (NABT)—routinely 
assure the public that the contemporary version of Darwinian theory 
enjoys unequivocal support among qualified scientists and that the 
evidence of biology overwhelmingly supports the theory. For example, 
in 2006 the AAAS declared, “There is no significant controversy within 


the scientific community about the validity of the theory of 
evolution.”!* The media dutifully echo these pronouncements. As New 
York Times science writer Cornelia Dean asserted in 2007, “There is no 
credible scientific challenge to the theory of evolution as an explanation 
for the complexity and diversity of life on earth.”!> 

The extent of the disparity between popular representations of the 
status of the theory and its actual status, as indicated in the peer- 
reviewed technical journals, came home to me with particular 
poignancy as I was preparing to testify before the Texas State Board of 
Education in 2009. At the time the board was considering the adoption 
of a provision in its science education standards that would encourage 
teachers to inform students of both the strengths and weaknesses of 
scientific theories. This provision had become a political hot potato after 
several groups asserted that “teaching strengths and weaknesses” were 
code words for biblical creationism or for removing the teaching of the 
theory of evolution from the curriculum. Nevertheless, after defenders 
of the provision insisted that it neither sanctioned teaching creationism 
nor censored evolutionary theory, opponents of the provision shifted 
their ground. They attacked the provision by insisting that there was no 
need to consider weaknesses in modern evolutionary theory because, as 
Eugenie Scott, spokeswoman for the National Center for Science 
Education, insisted in The Dallas Morning News, “There are no 
weaknesses in the theory of evolution.”!® 

At the same time, I was preparing a binder of one hundred peer- 
reviewed scientific articles in which biologists described significant 
problems with the theory—a binder later presented to the board during 
my testimony. So I knew—unequivocally—that Dr. Scott was 
misrepresenting the status of scientific opinion about the theory in the 
relevant scientific literature. I also knew that her attempts to prevent 
students from hearing about significant problems with evolutionary 
theory would have likely made Charles Darwin himself uncomfortable. 
In On the Origin of Species, Darwin openly acknowledged important 
weaknesses in his theory and professed his own doubts about key 
aspects of it. Yet today’s public defenders of a Darwin-only science 


curriculum apparently do not want these, or any other scientific doubts 
about contemporary Darwinian theory, reported to students. 

This book addresses Darwin’s most significant doubt and what has 
become of it. It examines an event during a remote period of geological 
history in which numerous animal forms appear to have arisen suddenly 
and without evolutionary precursors in the fossil record, a mysterious 
event commonly referred to as the “Cambrian explosion.” As he 
acknowledged in the Origin, Darwin viewed this event as a troubling 
anomaly—one that he hoped future fossil discoveries would eventually 
eliminate. 

The book is divided into three main parts. Part One, “The Mystery of 
the Missing Fossils,” describes the problem that first generated 
Darwin’s doubt—the missing ancestors of the Cambrian animals in the 
earlier Precambrian fossil record—and then tells the story of the 
successive, but unsuccessful, attempts that biologists and 
paleontologists have made to resolve that mystery. 

Part Two, “How to Build an Animal,” explains why the discovery of 
the importance of information to living systems has made the mystery 
of the Cambrian explosion more acute. Biologists now know that the 
Cambrian explosion not only represents an explosion of new animal 
form and structure but also an explosion of information—that it was, 
indeed, one of the most significant “information revolutions” in the 
history of life. Part Two examines the problem of explaining how the 
unguided mechanism of natural selection and random mutations could 
have produced the biological information necessary to build the 
Cambrian animal forms. This group of chapters explains why so many 
leading biologists now doubt the creative power of the neo-Darwinian 
mechanism and it presents four rigorous critiques of the mechanism 
based on recent biological research. 

Part Three, “After Darwin, What?” evaluates more current 
evolutionary theories to see if any of them explain the origin of form 
and information more satisfactorily than standard neo-Darwinism does. 
Part Three also presents and assesses the theory of intelligent design as 
a possible solution to the Cambrian mystery. A concluding chapter 


discusses the implications of the debate about design in biology for the 
larger philosophical questions that animate human existence. As the 
story of the book unfolds, it will become apparent that a seemingly 
isolated anomaly that Darwin acknowledged almost in passing has 
grown to become illustrative of a fundamental problem for all of 
evolutionary biology: the problem of the origin of biological form and 
information. 

To see where that problem came from and why it has generated a 
crisis in evolutionary biology, we need to begin at the beginning: with 
Darwin’s own doubt, with the fossil evidence that elicited it, and with a 
clash between a pair of celebrated Victorian naturalists—the famed 
Harvard paleontologist Louis Agassiz and Charles Darwin himself. 


Part One 


The Mystery of the Missing Fossils 
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Darwin’s Nemesis 


When Charles Darwin finished his famous book, he thought that he had 
explained every clue but one. 

By anyone’s measure, On the Origin of Species was a singular 
achievement. Like a great Gothic cathedral, the ambitious work 
integrated many disparate elements into a grand synthesis, explaining 
phenomena in fields as diverse as comparative anatomy, paleontology, 
embryology, and biogeography. At the same time, it was impressive for 
its simplicity. Darwin’s Origin explained many classes of biological 
evidence with just two central organizing ideas. The twin pillars of his 
theory were the ideas of universal common ancestry and natural 
selection. 

The first of these pillars, universal common ancestry, represented 
Darwin’s theory of the history of life. It asserted that all forms of life 
have ultimately descended from a single common ancestor somewhere 
in the distant past. In a famous passage at the end of the Origin, Darwin 
argued that “all the organic beings which have ever lived on this earth 
have descended from some one primordial form.”! Darwin thought that 
this primordial form gradually developed into new forms of life, which 
in turn gradually developed into other forms of life, eventually 
producing, after many millions of generations, all the complex life we 
see in the present. 

Biology textbooks today usually depict this idea just as Darwin did, 
with a great branching tree. The trunk of Darwin’s tree of life represents 
the first primordial organism. The limbs and branches of the tree 
represent the many new forms of life that developed from it (see Fig. 


1.1). The vertical axis on which the tree is plotted represents the arrow 
of time. The horizontal axis represents changes in biological form, or 
what biologists call “morphological distance.” 
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FIGURE 1.1 
Darwin’s evolutionary tree of life, as depicted by the nineteenth-century 
German evolutionary biologist Ernst Haeckel. 


Biologists often call Darwin’s theory of the history of life “universal 
common descent” to indicate that every organism on earth arose from a 
single common ancestor by a process of “descent with modification.” 
Darwin argued that this idea best explained a variety of biological 
evidences: the succession of fossil forms, the geographical distribution 
of various species (such as Galapagos finches), and the anatomical and 
embryological similarities among otherwise highly distinct organisms. 

The second pillar of Darwin’s theory affirmed the creative power of a 
process he called natural selection, a process that acted on random 
variations in the traits or features of organisms and their offspring. 
Whereas the theory of universal common descent postulated a pattern 
(the branching tree) to represent the history of life, Darwin’s idea of 
natural selection referred to a process that he said could generate the 
change implied by his branching tree of life. 

Darwin formulated the idea of natural selection by analogy to a well- 
known process, that of “artificial selection” or “selective breeding.” 
Anyone in the nineteenth century familiar with the breeding of domestic 
animals—dogs, horses, sheep, or pigeons, for example—knew that 
human breeders could alter the features of domestic stock by allowing 
only animals with certain traits to breed. A sheepherder from the north 
of Scotland might breed for a woollier sheep to enhance its chances of 
survival in a cold northern climate (or to harvest more wool). To do so, 
he would choose only the woolliest males and woolliest ewes to breed. If 
generation after generation he continued to select and breed only the 
woolliest sheep among the resulting offspring, he would eventually 
produce a woollier breed of sheep. In such cases, “the key is man’s 
power of accumulative selection,” wrote Darwin. “Nature gives 
successive variations; man adds them up in certain directions useful to 
him.” 


Darwin noted that pigeons have been coaxed into a dizzying variety 
of breeds: the carrier, with its elongated eyelids and a “wide gape of 
mouth”; the “short-faced tumbler,” with its “beak in outline almost like 
that of a finch”; the common tumbler, with its penchant for flying in 
close formation and “tumbling in the air head over heels”; and, perhaps 
strangest of all, the pouter, with its elongated legs, wings, and body 
overshadowed by its “enormously developed crop, which it glories in 
inflating” for its astonished patrons.* 

Of course, pigeon breeders achieved these startling metamorphoses 
by carefully sifting and selecting. But, as Darwin pointed out, nature 
also has a means of sifting: defective creatures are less likely to survive 
and reproduce, while those offspring with beneficial variations are more 
likely to survive, reproduce, and pass on their advantages to future 
generations. In the Origin, Darwin argued that this process, natural 
selection acting on random variations, could alter the features of 
organisms just as intelligent selection by human breeders can. Nature 
itself could play the role of the breeder. 

Consider once more our flock of sheep. Imagine that instead of a 
human selecting the woolliest males and ewes to breed, a series of very 
cold winters ensures that all but the very woolliest sheep in a population 
die. Now again only very woolly sheep will remain to breed. If the cold 
winters continue over several generations, will the result not be the same 
as before? Won’t the population of sheep eventually become discernibly 
woollier? 

This was Darwin’s great insight. Nature—in the form of 
environmental changes or other factors—could have the same effect on a 
population of organisms as the intentional decisions of an intelligent 
agent. Nature would favor the preservation of certain features over 
others—specifically, those that conferred a functional or survival 
advantage upon the organisms possessing them—causing the features of 
the population to change. And the resulting change will have been 
produced not by an intelligent breeder choosing a desirable trait or 
variation—not by “artificial selection’—but by a wholly natural process. 
What’s more, Darwin concluded that this process of natural selection 





acting on randomly arising variations had been “the chief agent of 
change” in generating the great branching tree of life in all its variety. 

On the Origin of Species seized the attention of the scientific 
community like a thunderclap. Darwin’s analogy to artificial selection 
was powerful, his proposed mechanism of natural selection and random 
variation easily grasped, and his skill in dispensing with potential 
objections unrivalled. Moreover, the explanatory scope of his argument 
for universal common descent constituted something of a tour de force. 
By the close of the Origin, it seemed to many that Darwin had dispensed 
with every conceivable objection to his theory but one. 


THE ANOMALY: DARWIN’S DOUBT 


Despite the scope of his synthesis, there was one set of facts that 
troubled Darwin—something he conceded his theory couldn’t 
adequately explain, at least at present. Darwin was puzzled by a pattern 
in the fossil record that seemed to document the geologically sudden 
appearance of animal life in a remote period of geologic history, a period 
that at first was commonly called the Silurian, but later came to be 
known as the Cambrian. 

During this geological period, many new and anatomically 
sophisticated creatures appeared suddenly in the sedimentary layers of 
the geologic column without any evidence of simpler ancestral forms in 
the earlier layers below, in an event that paleontologists today call the 
Cambrian explosion. Darwin frankly described his concerns about this 
conundrum in the Origin: “The difficulty of understanding the absence 
of vast piles of fossiliferous strata, which on my theory were no doubt 
somewhere accumulated before the Silurian [i.e., Cambrian] epoch, is 
very great,” he wrote. “I allude to the manner in which numbers of 
species of the same group suddenly appear in the lowest known 
fossiliferous rocks.”° The sudden appearance of animals so early in the 
fossil record did not easily accord with Darwin’s new theory of gradual 
evolutionary change, and there was one scientist who would not let him 
forget it. 


THE ANTAGONIST 


Swiss-born paleontologist Louis Agassiz, of Harvard University, was 
one of the best-trained scientists of his age, and he knew the fossil 
record better than any man alive. Hoping to enlist Agassiz as an ally, 
Darwin sent him a copy of On the Origin of Species and asked him to 
consider the argument with an open mind (see Fig. 1.2). One can almost 
see the great, aging naturalist receiving the unremarkable package from 
the postman, unwrapping the small green volume that had stirred such a 
tempest on both sides of the Atlantic. Perhaps he retired to his study the 
better to concentrate, scrutinizing the book’s prepossessing title, 
recalling what he had already heard about the work. He read the book 
with deep interest, making notes in the margin as he moved through it, 
but in the end his verdict would disappoint its author. Agassiz concluded 
that the fossil record, particularly the record of the explosion of 
Cambrian animal life, posed an insuperable difficulty for Darwin’s 
theory. 





FIGURE 1.2 
Figure 1.2a (left): Louis Agassiz. Figure 1.2b (right): Charles Darwin. 


THE TWO-PRONGED CHALLENGE 


To see why, consider brachiopods and trilobites, two of the best- 
documented creatures in the Cambrian fossil record by 1859. The 


brachiopod (see Fig. 1.3), with its two shells, looks like a clam or an 
oyster, but is very different inside. As shown in the accompanying 
figure, it possesses a gonad, mantle, mantle cavity, anterior body wall, 
body cavity, gut, and lophophore, the last of which is a feeding organ 
like a ring of tentacles, usually in the shape of a coil or horseshoe, with a 
mouth inside the ring of tentacles, and an anus outside. The brachiopod 
exhibits a highly complex overall body plan, with many individually 
complex and functionally integrated anatomical systems and parts. Its 
tentacles, for instance, are covered by cilia precisely arranged to 
generate and direct a current of water toward the mouth.° 
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FIGURE 1.3 
Figure 1.3a (top): Brachiopod internal anatomy. Figure 1.3b (bottom, 
left): Brachiopod fossil showing remains of internal structure. Courtesy 
Paul Chien. Figure 1.3c (bottom, right): Fossil showing exterior 
structure of brachiopod shell. Courtesy Corbis. 
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FIGURE 1.4 
Figure 1.4a (top): Trilobite anatomy. Figure 1.4b (bottom): Trilobite 
fossil of the species Kuanyangia pustulosa. Courtesy Illustra Media. 


Even more sophisticated was the trilobite (see Fig. 1.4), with its three 
longitudinal lobes across its head (a raised middle lobe and a flatter 
pleural lobe to either side) and a body divided into three parts—head, 
chest, and tail, the former two consisting of as many as thirty segments. 
It had a pair of legs for every pleural groove and another three pairs for 
the head. Most dramatic of all were the compound eyes found on even 
some of the very early trilobites—eyes that afforded these not so 
primitive animals a 360-degree field of vision.’ 

The abrupt appearance of such complex anatomical designs presented 
a challenge to each of the two main parts of Darwin’s theory of 
evolution. 


THE CAMBRIAN EXPLOSION AND THE ACTION OF 
NATURAL SELECTION 


The Cambrian fossil evidence represented a significant challenge to 
Darwin’s claim that natural selection had the capacity to produce novel 
forms of life. As Darwin described it, the ability of natural selection to 
produce significant biological change depends upon the presence of 
three distinct elements: (1) randomly arising variations, (2) the 
heritability of those variations, and (3) a competition for survival, 
resulting in differences in reproductive success among competing 
organisms. 

According to Darwin, variations in traits arise randomly. Some 
variations (such as thicker fleece) might confer advantages in the 
competition for survival in particular environmental conditions. Those 
variations that are heritable and that impart functional or survival 
advantage will be preserved in the next generation. As nature “selects” 
these successful variations, the features of a population change. 


Darwin conceded that the beneficial variations responsible for 
permanent change in species are both rare and necessarily modest. Major 
variations in forms, what later evolutionary biologists would term 
“macromutations,” inevitably produce deformity and death. Only minor 
variations meet the test of viability and heritability. 

It followed that, over human timescales, the benefits of this 
evolutionary mechanism would be difficult or impossible to spot. But 
given enough time, favorable variations would gradually accumulate 
and give rise to new species and, given more time, even fundamentally 
new groups of organisms and body designs. If artificial selection could 
conjure so many strange breeds from a wild strain in a few centuries, 
Darwin argued, imagine what natural selection could achieve over many 
millions of years. Even the origin of complex structures such as the 
mammalian eye—which seemed at first to present a significant 
challenge to his theory—could be explained if one postulated the 
existence of an initially simpler structure (such as a light sensitive spot) 
that could be gradually modified over long periods of time. 

And that was the rub. Darwin’s mechanism of natural selection and 
random variation necessarily required a lot of time to generate wholly 
novel organisms, creating a dilemma that Agassiz was keen to expose. 

In an 1874 Atlantic Monthly essay titled “Evolution and the 
Permanence of Type,” Agassiz explained his reasons for doubting the 
creative power of natural selection. Small-scale variations, he argued, 
had never produced a “specific difference” (1.e., a difference in species). 
Meanwhile, large-scale variations, whether achieved gradually or 
suddenly, inevitably resulted in sterility or death. As he put it, “It is a 
matter of fact that extreme variations finally degenerate or become 
sterile; like monstrosities they die out.” 

Darwin himself insisted that the process of evolutionary change he 
envisioned must occur very gradually for the same reason. Thus, Darwin 
realized that building, for instance, a trilobite from single-celled 
organisms by natural selection operating on small, step-by-step 
variations would require countless transitional forms and failed 
biological experiments over vast stretches of geologic time. As 


University of Washington paleontologist Peter Ward would later explain, 
Darwin had very specific expectations for what paleontologists would 
find below the lowest known strata of animal fossils—in particular, 
“intervening strata showing fossils of increasing complexity until finally 
trilobites appeared.”” As Darwin noted, “If my theory be true, it is 
indisputable that before the lowest Silurian [Cambrian] stratum was 
deposited, long periods elapsed, as long as, or probably far longer than, 
the whole interval from the Silurian age to the present day; and that 
during these vast, yet quite unknown, periods of time, the world 
swarmed with living creatures.” 1° 

The mechanism of natural selection necessarily had to work 
gradually on small incremental variations. And, indeed, the kinds of 
variations that Darwin actually observed and described in developing his 
analogy between natural and artificial selection were in every case 
minor. Only by selecting and accumulating minor variations over many 
generations were breeders able to produce the striking changes in the 
features of a breed, changes that were, nevertheless, extraordinarily 
modest compared to the radical differences in form between, say, 
Precambrian and Cambrian forms of life. At the end of the day, as 
Agassiz hastened to note, the pigeons Darwin cited in support of the 
creative power of artificial and, by analogy, natural selection were still 
pigeons. More significant changes to the form and anatomical structure 
of organisms would, by the logic of Darwin’s mechanism, require untold 
millions of years, precisely what seemed unavailable in the case of the 
Cambrian explosion. 


THE CAMBRIAN EXPLOSION AND THE TREE OF LIFE 


The abrupt appearance of the Cambrian fauna also posed a separate but 
related difficulty for Darwin’s picture of a continuously branching tree 
of life. To produce truly novel animal forms, the Darwinian mechanism 
would—by its own internal logic—require not only millions of years, 
but untold generations of ancestors. Thus, even the discovery of a 
handful of plausible intermediates allegedly linking a Precambrian 


ancestor to a Cambrian descendant wouldn’t come close to fully 
documenting Darwin’s picture of the history of life. If Darwin is right, 
Agassiz argued, then we should find not just one or a few missing links, 
but innumerable links shading almost imperceptibly from alleged 
ancestors to presumed descendants. Geologists, however, had found no 
such myriad of transitional forms leading to the Cambrian fauna. 
Instead, the stratigraphic column seemed to document the abrupt 
appearance of the earliest animals. 

Agassiz thought the evidence of abrupt appearance, and the absence 
of ancestral forms in the Precambrian, refuted Darwin’s theory.!! Of 
these earlier forms, Agassiz asked, “Where are their fossilized remains?” 
He insisted that Darwin’s picture of the history of life “contradict[ ed] 
what the animal forms buried in the rocky strata of our earth tell us of 
their own introduction and succession upon the surface of the globe. Let 
us therefore hear them;—for, after all, their testimony is that of the eye- 
witness and the actor in the scene.” !? 


MURCHISON, SEDGWICK, AND THE CAMBRIAN FOSSILS 
OF WALES 


Darwin, for his part, responded with more than civility. Far from 
dismissing Agassiz, he conceded that his objection carried considerable 
force. Nor was Agassiz alone in pressing these concerns. Other leading 
naturalists thought the fossil evidence presented a significant obstacle to 
Darwin’s theory. At the time, perhaps the best place to investigate the 
lowest known strata of fossils was Wales, and one of its leading experts 
was Roderick Impey Murchison, who named the earliest geologic period 
the Silurian after an ancient Welsh tribe. Five years before On the Origin 
of Species, he called attention to the sudden appearance of complex 
designs like the compound eyes of the first trilobites, creatures already 
thriving at the apparent dawn of animal life. For him, this discovery 
ruled out the idea that these creatures had evolved gradually from some 
primitive and relatively simple form: “The earliest signs of living things, 
announcing as they do a high complexity of organization, entirely 


exclude the hypothesis of a transmutation from lower to higher grades of 
being.” !? 

The other pioneering explorer of Wales’s rich fossil record, Adam 
Sedgwick, also thought that Darwin had leaped beyond the evidence, as 
he told him in a letter in the fall of 1859: “You have deserted—after a 
start in that tram-road of all solid physical truth—the true method of 
induction.” '* Sedgwick might have had in mind the same evidence the 
two men had studied together some twenty-eight years before when the 
Cambridge professor had brought Darwin along as his field assistant to 
explore, in the Upper Swansea Valley in northwestern Wales, the very 
strata that seemed to testify so powerfully to the sudden appearance of 
animal life. It was these strata that Sedgwick named after a Latinized 
English term for the country of Wales—“Cambria,” a designation that 
eventually replaced “Silurian” as the name for the earliest strata of 
animal fossils. 
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FIGURE 1.5 
Three organisms that first appear in the Ordivician period: eurypterans 
(sea scorpions), starfish, and tetracoral. 


Sedgwick emphasized that these Cambrian animal fossils appeared to 
pop out of nowhere into the geological column. But he also stressed 
what he viewed as a broader reason to doubt Darwin’s evolutionary 
model: the sudden appearance of the Cambrian animals was merely the 
most outstanding instance of a pattern of discontinuity that extends 
throughout the geologic column. Where in the Ordovician strata, for 
instance, are many of the families of the trilobites and brachiopods 
present in the Cambrian just below it?!> These creatures along with 
numerous other types suddenly disappear. But just as suddenly one finds 
newcomers in the Ordovician strata like the eurypterans (sea scorpions), 
starfish, and tetracorals (see Fig. 1.5).!° In a later Paleozoic period called 
the Devonian, the first amphibians (e.g., Ichthyostega) arise. Much later, 
many staples of the Paleozoic era (which encompasses the Cambrian, 
Ordovician, and four subsequent periods) suddenly go extinct in a period 
called the Permian.'’ Then, in the Triassic period that follows, 
completely novel animals such as turtles and dinosaurs emerge.!* Such 
discontinuity, Sedgwick argued, is not the exception, but the rule. 


DATING BY DISCONTINUITY 


Already by Sedgwick’s time, the various strata of fossils had proved so 
distinct one from another that geologists had come to use the sharp 
discontinuities between them as a key means for dating rocks. 
Originally, the best tool for determining the relative age of various strata 
was based on the notion of superposition. Put simply, unless there is a 
reason to believe otherwise, a geologist provisionally assumes that lower 
rocks were put down before the rocks above them. Now, contrary to a 
widespread caricature, no respected geologist, then or now, adopts this 
method uncritically. The most basic training in geology teaches that rock 
formations can be twisted, upended, even mixed pell-mell by a variety of 
phenomena. This is why geologists have always looked for other means 
to estimate the relative age of different strata. 

In 1815, Englishman William Smith had hit upon just such an 
alternative means.!? While studying the distinct fossil strata exposed 


during canal construction, Smith noted that so dissimilar are the fossil 
types among different major periods and so sharp and sudden the break 
between them, that geologists could use this as one method for 
determining the relative age of strata. Even when layers of geological 
strata are twisted and turned, the clear discontinuities between the 
various strata often allow geologists to discern the order in which they 
were deposited, particularly when there is a broad enough sampling of 
rich geological sites from the period under investigation to study and 
cross-reference. Although not without its pitfalls, this approach has 
become a standard dating technique, used in conjunction with 
superposition and other more recent radiometric dating methods." 










EPOCHS DURATION 
(NORTH AMERICA) (Iw MILLTONS OF YEARS) 


LTERNATE — 


PERIODS gaia: odg 





ERES 


QUARTERNARY | HOLOCENE EPOCH 
Penrod [Eaa o = 
NEOGENE PLEISTOCENE EPOCH 
PERICD PLIOCENE EPOCH Y 
CENOZOIC MICCENE EPOCH 
ERA TERTIARY 63.4 
PERIOD | OLIGOCENE EPOCH MY. 















PALEOGENE EOCENE EPOCH 
ius 


CRETACEOUS PERIOD | 
MESOZOIC]  JURASTC PERIOD 





TRIASSIC PERIOD 


PALEOZOIC 
ERA 








CAMBRIAN PERIOD 
SZ 530 MILLION YEARS AGO 


NECPROC- 
TEROZCIC 























ERA C2635 MY BGO => 
@RECAMBRIAN{ OTHER PRECAMBRIAN APPROX. 
TIME) PERIODS 4,000 MY. 


FIGURE 1.6 
The geological timescale. 


Indeed, it’s difficult to overemphasize how central the approach is to 
modern historical geology. As Harvard paleontologist Stephen Jay 
Gould explains, it is the phenomenon of fossil succession that dictates 
the names of the major periods in the geological column (see Fig. 1.6). 
“We might take the history of modern multi-cellular life, about 600 
million years, and divide this time into even and arbitrary units easily 
remembered as 1—12 or A-L, at 50 million years per unit,” Gould writes. 
“But the earth scorns our simplifications, and becomes much more 
interesting in its derision. The history of life is not a continuum of 
development, but a record punctuated by brief, sometimes geologically 
instantaneous, episodes of mass extinction and subsequent 
diversification.””! The question that Darwin’s early critics posed was 
this: How could he reconcile his theory of gradual evolution with a fossil 
record so discontinuous that it had given rise to the names of the major 
distinct periods of geological time, particularly when the first animal 
forms seemed to spring into existence during the Cambrian as if from 
nowhere? 


A SOLUTION UNSEEN 


Of course, Darwin was well aware of these problems. As he noted in the 
Origin, “The abrupt manner in which whole groups of species suddenly 
appear in certain formations has been urged by several paleontologists— 
for instance, by Agassiz, Pictet, and Sedgwick—as a fatal objection to 
the belief in the transmutation of species. If numerous species, belonging 
to the same genera or families, have really started into life all at once, 
the fact would be fatal to the theory of descent with slow modification 
through natural selection.”?* Darwin, however, proposed a possible 
solution. He suggested that the fossil record may be significantly 
incomplete: either the ancestral forms of the Cambrian animals were not 
fossilized or they hadn’t been found yet. “I look at the natural geological 


record, as a history of the world imperfectly kept, and written in a 
changing dialect,” Darwin wrote. “Of this history we possess the last 
volume alone, relating only to two or three countries. Of this volume, 
only here and there a short chapter has been preserved; and of each page, 
only here and there a few lines. . . . On this view, the difficulties above 
discussed are greatly diminished, or even disappear.” 

Darwin himself was less than satisfied with this explanation.” 
Agassiz, for his part, would have none of it. “Both with Darwin and his 
followers, a great part of the argument is purely negative,” he wrote. 
They “thus throw off the responsibility of proof. . .. However broken the 
geological record may be, there is a complete sequence in many parts of 
it, from which the character of the succession may be ascertained.” On 
what basis did he make this claim? “Since the most exquisitely delicate 
structures, as well as embryonic phases of growth of the most perishable 
nature, have been preserved from very early deposits, we have no right 
to infer the disappearance of types because their absence disproves some 
favorite [i.e., Darwinian] theory.”° 

Though Darwin himself was less than enthusiastic about his response 
to Agassiz’s objection, it seemed adequate to satisfy the needs of the 
moment. The overwhelming preponderance of evidence that Darwin had 
marshaled seemed to support his theory. In any case, many leading 
naturalists—Joseph Hooker, Thomas Huxley, Ernst Haeckel, and Asa 
Gray—all younger than Agassiz, quickly aligned themselves with his 
evolutionary line of thinking. True, some scientists, notably the Scottish 
engineering professor Fleeming Jenkin and (later) the English geneticist 
William Bateson, expressed persistent doubts about the efficacy of 
natural selection. But despite the views of some weighty scientific 
critics, Darwin’s revolutionary theory won increasingly wide support 
and soon defined the terms of the debate about the history of life. Those 
who rejected it wholesale, as Agassiz did, consigned themselves to 
increasing irrelevance. 
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AGASSIZ UNDER THE MICROSCOPE 


So did Agassiz identify a genuine problem for Darwin’s theory, a 
mystery, at least, waiting to be solved? If so, whatever became of this 
problem? And if not, how could such a brilliant and knowledgeable 
scientist, someone so steeped in the evidence, fall so far outside the 
mainstream of scientific opinion? 

Historians of science in the post-Darwinian era have typically 
attempted to answer this later question by portraying Agassiz as a 
brilliant and respected scientist who nevertheless was too ossified to 
catch the new wave, a figure past his prime and mired in philosophical 
prejudice.*° Biographer Edward Lurie describes the Harvard naturalist 
as a “giant of the nineteenth century . . . a person deeply involved in his 
surroundings, a man who understood the possibilities of life with an 
uncommon awareness.””’ Similarly, historian Mabel Robinson says that 
she long awaited a biography of Agassiz that “would re-create this man 
of genius and his headlong splendid race through life.” He was, she said, 
“a man to remember because genius is rare,” “an immortal Pied Piper.”?° 
These scholars are merely echoing what Agassiz’s contemporaries, even 
Darwin himself, said. “What a set of men you have at Harvard! ” Darwin 
told the American poet Henry Wadsworth Longfellow. “Both our 
universities put together cannot furnish the like. Why, there is Agassiz— 
he counts for three.”?? 

Even so, many historians argue that Agassiz was too infected by 
German idealism to properly assess the factual basis of Darwin’s case. 
According to idealist philosophers of biology, living forms exemplified 
transcendent ideas and in their organization provided evidence of 
purposive design in nature. Comments historian A. Hunter Dupree, 
“Agassiz’s idealism was of course the basis of his concepts of species 
and their distribution,” of his insistence that a divine or intellectual cause 
must stand behind the origin of each type.?? The ship of science was 
transitioning from idealism to modern empiricism. Agassiz had fallen 
overboard, since he had imbibed too deeply an outmoded idealism from 
his teacher, the French anatomist Georges Cuvier, and from philosophers 
like Friedrich Schelling, who “ran wild in trying to put all nature into a 
unified and absolute system of ideas.”>! 
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Agassiz wasn’t merely wrong, 


Dupree explains, but an annoying obscurantist, actively fighting “against 
the extension of empiricism into natural history.”>? 

Edward Lurie offers a similar if somewhat more nuanced assessment: 
although “quite capable of making the most admirable scientific 
discoveries reflecting complete devotion to scientific method,” Agassiz 
“would then interpret the data through the medium of what seemed to be 
the most absurd metaphysics.’”*? The very man who made “the most 
careful, exact, and precise descriptions” of the natural world would, in 
his generalizations from those observations, “indulge in flights of 
idealistic fancy.”°+ In short, Lurie thought that “Agassiz’s cosmic 
philosophy shaped his entire reaction to the evolution idea.” 

As science advanced in the late nineteenth century, it increasingly 
excluded appeals to divine action or divine ideas as a way of explaining 
phenomena in the natural world. This practice came to be codified in a 
principle known as methodological naturalism. According to this 
principle, scientists should accept as a working assumption that all 
features of the natural world can be explained by material causes without 
recourse to purposive intelligence, mind, or conscious agency. 

Proponents of methodological naturalism argue that science has been 
so successful precisely because it has assiduously avoided invoking 
creative intelligence and, instead, searched out strictly material causes 
for previously mysterious features of the natural world. In the 1840s, the 
French philosopher August Comte argued that science progresses 
through three distinct phases. In its theological phase, it invokes the 
mysterious action of the gods to explain natural phenomena, whether 
thunderbolts or the spread of disease. In a second, more advanced 
metaphysical stage, scientific explanations refer to abstract concepts like 
Plato’s forms or Aristotle’s final causes. Comte taught that science only 
reaches maturity when it casts aside such abstractions and explains 
natural phenomena by reference to natural laws or strictly material 
causes or processes. Only in this third and final stage, he argued, can 
science achieve “positive” knowledge. 

During the late nineteenth century, scientists increasingly embraced 
this “positivistic” vision.’ Agassiz, by insisting that the Cambrian 


fossils pointed to “acts of mind”*’ and an “intervention of an intellectual 
power,” stood firmly against this new vision. For many, his reference to 
the work of a transcendent mind merely demonstrated that he was unable 
to abandon an outmoded idealistic approach. The train of scientific 
progress had left Agassiz behind. 


AN OLD FOSSIL RECOVERED 


Though clearly Agassiz did reject the principle of methodological 
naturalism, as it is now named, there are problems with portraying him 
as a fossil of another age. First, Agassiz was unsurpassed in his 
commitment to the empirical method. It is Agassiz about whom the story 
is told of the professor instructing one of his students to observe a fish 
for three arduous days, a story iconic enough that it is reprinted in 
freshman composition textbooks. In the story, the student, Samuel 
Scudder, pulls out his hair trying to see anything new about the slimy 
creature, wondering why Professor Agassiz is torturing him with this 
“hideous fish.” But in the end Scudder breaks through to new levels of 
observational depth and precision. Mabel Robinson notes that if such 
teaching methods seem less revolutionary to contemporary readers than 
they did to Scudder, that’s because Agassiz trained an army of able 
young naturalists who took his method to other universities, and they in 
turn passed them on to their students, themselves future professors.*® 

William James, the founder of American pragmatism, extolled 
Agassiz’s commitment to empirical rigor in a letter he wrote to his father 
while on an expedition with Agassiz in 1865 to South America. In the 
letter the young man commented that he felt a “greater feeling of weight 
and solidity about the presence of this great background of special facts 
than about the mind of any other man I know,”?? a storehouse of precise 
data made possible by “a rapidity of observation, and a capacity to 
recognize them again and remember everything about them.” James 
would eventually enter the field of psychology, but he took with him the 
empirical approach to problem solving that Agassiz had modeled so 
impressively.*! 


As Lurie concedes, Agassiz’s stature among American scientists 
grew out of his unrivaled knowledge of geology, paleontology, 
ichthyology, comparative anatomy, and taxonomy. So passionate was 
Agassiz for the particulars of the natural world that he began organizing 
a system of information-sharing among naturalists, sailors, and 
missionaries around the world. He collected more than 435 barrels of 
specimens, among them an extremely rare group of fossil plants.? In a 
single year, Agassiz amassed more than 91,000 specimens and identified 
close to 11,000 new species,** making Harvard’s natural history museum 
preeminent among such museums in the world. 

He also appears to have gone to great lengths, literally and 
figuratively, to assess On the Origin of Species empirically, going so far 
as to make a research voyage retracing Darwin’s trip to the Galapagos 
Islands. As he explained to German zoologist Carl Gegenbauer, he 
“wanted to study the Darwin theory free from all external influences and 
former prejudices.”4+ 

The idea that religious or philosophical prejudice compromised 
Agassiz’s scientific judgment raises other questions. As historian Neal 
Gillespie explains, Agassiz was “second to no man in his opposition to 
sectarian religious interference with science.”4> Moreover, Agassiz 
showed himself perfectly willing to accept natural mechanisms where 
before supernatural intervention had been the preferred explanation. 
Since he regarded material forces, and the laws of nature that described 
them, as the products of an underlying design plan, he saw any creative 
work they did as deriving ultimately from a creator. For instance, he 
assumed this was the case with the development of embryos: he 
attributed their natural evolution from zygote to adult as a natural 
phenomenon and considered this no threat to his belief in a creator.*° He 
also readily accepted the notion of a naturally evolving solar system.*/ 
He thought a skillful cosmic architect could work through secondary 
natural causes every bit as effectively as through direct acts of agency. 
The marginalia in his copy of On the Origin of Species suggest that he 
had this same attitude concerning biological evolution. “What is the 


great difference,” he wrote, “between supposing that God makes 
variable species or that he makes laws by which species vary?” 

A third problem with the official portrait of Darwin’s chief rival 
concerns Lurie’s suggestion that Agassiz was a master of particulars, but 
not of generalizing from those particulars. The historical record suggests 
otherwise. For example, Agassiz was the man who ably generalized 
from a wide array of particular clues in his work on the ice age, winning 
over the geological establishment by demonstrating how a range of facts 
were best explained by the action of retreating glaciers. 

Here a direct comparison between Darwin and Agassiz is possible. 
Each searched for an explanation of a curious geological phenomenon in 
the Scottish Highlands, the parallel roads of Glen Roy. Glen Roy is the 
valley of the River Roy and, although it’s a place of breathtaking beauty, 
what visitors found most intriguing about it over the years were its three 
parallel roads that wind along the canyon wall on either side of the river 
(see Fig. 1.7).48 Scottish legend held that they were hunting paths built 
for use by early Scottish kings or perhaps even for the mythical warrior 
Fingal. Scientists later argued that the roads were natural rather than 
artificial. Darwin and Agassiz were both convinced that natural 
processes were the cause, but they nevertheless arrived at different 
explanations. What was the end of the matter? In his autobiography, 
Darwin explained, “Having been deeply impressed with what I had seen 
of the elevation of the land in S. America, I attributed the parallel lines 
to the action of the sea; but I had to give up this view when Agassiz 
propounded his glacier-lake theory.’“? Subsequent investigations in the 
late nineteenth and early twentieth centuries confirmed that Agassiz’s 
interpretation was the correct one.°° 

Agassiz, then, was far more than just a walking encyclopedia or an 
insatiable gatherer of fossils who couldn’t see the proverbial forest for 
the trees. Those who insist otherwise can point to but one example to 
support their position, namely, his rejection of Darwin’s theory; but they 
cannot use that example to establish his general inability to interpret 
evidence and then turn around and use that supposed inability to explain 
his failure to accept Darwin’s theory. That is to argue in a circle. 


There is a far more obvious solution to the historical puzzle posed by 
the great Agassiz’s objection to Darwin’s theory: the fossils of the 
Cambrian strata do, in fact, arise abruptly in the geological record, in 
clear defiance of what Darwin’s theory would lead us to expect. In short, 
a genuine mystery is at hand. 





FIGURE 1.7 
Parallel roads of Glen Roy. 


Two final considerations lend support to this view. First, as already 
noted, Darwin himself accepted the validity of Agassiz’s objection.*! As 
he acknowledged elsewhere in the Origin, “To the question why we do 
not find rich fossiliferous deposits belonging to these assumed earliest 
periods prior to the Cambrian system, I can give no satisfactory answer. 
. . . The case at present must remain inexplicable; and may be truly 
urged as a valid argument against the views here entertained.” 

Second, Darwin’s attempt to account for the absence of the expected 
fossil ancestors of the Cambrian forms failed to address the full strength 
and subtlety of Agassiz’s objection. As Agassiz explained, the problem 
with Darwin’s theory was not just the general incompleteness of the 


fossil record or even a pervasive absence of ancestral forms of life in the 
fossil record. Rather, the problem, according to Agassiz, was the 
selective incompleteness of the fossil record. 

Why, he asked, does the fossil record always happen to be incomplete 
at the nodes connecting major branches of Darwin’s tree of life, but 
rarely—in the parlance of modern paleontology—at the “terminal 
branches” representing the major already known groups of organisms? 
These terminal branches were well represented (see Fig. 1.8), often 
stretching over many generations and millions of years, while the 
“internal branches” at the connecting nodes on Darwin’s tree of life were 
nearly always—and selectively—absent. As Agassiz explained, 
Darwin’s theory “rests partly upon the assumption that, in the succession 
of ages, just those transition types have dropped out from the geological 
record which would have proved the Darwinian conclusions had these 
types been preserved.”5? To Agassiz, it sounded like a just-so story, one 
that explains away the absence of evidence rather than genuinely 
explaining the evidence we have. 

Was there any easy answer to Agassiz’s argument? If so, beyond his 
stated willingness to wait for future fossil discoveries, Darwin didn’t 
offer one. 





FIGURE 1.8 

The vertical lines in these diagrams represent known animal phyla. The 
dots within the vertical lines represent animals from those phyla that 
have been found fossilized in different strata. The diagram on the left 
shows the animal tree of life as expected based upon Darwinian theory. 


The diagram on the right shows a simplified representation of the actual 
pattern of the Precambrian—Cambrian fossil record. Notice that fossils 
representing the internal branches and nodes, but not the terminal 
branches, are missing. 


AN ENDURING MYSTERY 


In the years immediately following the publication of On the Origin of 
Species, many of Agassiz’s concerns were temporarily swept aside as 
public and scientific fascination with Darwin’s ideas grew. Even so, a 
persistent mystery lay at the feet of biologists, one that subsequent 
generations of scientists would revisit and repeatedly seek to resolve. As 
Darwin noted, in his time, the fossils of the Cambrian were relatively 
few, and the period of the explosion only vaguely understood. But 
perhaps future scientists would come to his rescue with fresh 
discoveries. 

The story of the successive attempts to solve the Cambrian mystery 
stretches from Darwin’s time to the present, and from the Swansea 
Valley in southern Wales to remote fossil sites in southern China. In the 
next chapter, the work of detection moves from the late nineteenth 
century to the early twentieth, from the British Isles to British Columbia, 
and to a fossil site above the Kicking Horse River so astonishing that, 
even today, paleontologists and some of the most skeptical and hardened 
of scientific rationalists speak its name with childlike reverence. 


2 
The Burgess Bestiary 


Only in fiction can we expect such fine orchestration of setting and 
dramatic action. Gothic tales haunted by demons of the past have their 
thunderstorms and crumbling mansions, the existentialist novel its 
disorienting cityscapes, the romance its unattainable balconies laced 
with jasmine. In ordinary life, the staging is usually less precise. 
Intricate family tragedies unravel in tidy suburban ranch-style homes, 
while enchanted romances blossom over cubicle walls. But the twentieth 
century’s most revolutionary fossil discovery was more like fiction: the 
setting was commensurate with the moment. 

Photographs taken during the summer’s expedition show a lean and 
balding man with pleasant crinkles at the corners of his eyes and a deep 
thought line slashing down between his brows; he stands precariously on 
rocky ascents, pick and hoe at hand, gazing far into the distance from a 
stony peak, at ease among the forbidding slopes and treacherous ridges. 
Working his way over one ridge and then above the tree line of the next, 
Charles Doolittle Walcott reached a place where he could see for miles. 
To the northwest, the crude arrowhead of Mount Wapta jutted skyward. 
Below lay Emerald Lake, its waters green from the mineral-rich glacial 
till. To the east and west, snowy peaks stretched to the horizon (see Figs. 
2.1 and 2.2). Only the view to the northeast lacked a vista. Here was the 
homely shale of a barren ridge. Of course, as in any fairy tale, there lay 
the real prize, a hidden vista measured not in miles, but in ages. 
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FIGURE 2.1 
The scenery of the Burgess Shale and surrounding area. Courtesy 
Corbis. 





FIGURE 2.2 
Charles Doolittle Walcott in the field (c. 1911). Courtesy Smithsonian 
Institution Archives. 


Walcott, already the director of the Smithsonian Institution, was 
about to enter the most significant phase of his professional life. More 
than this, he was about to make perhaps the most dramatic discovery in 
the history of paleontology, a rich trove of middle Cambrian-era fossils, 
including many previously unknown animal forms, preserved in 
exquisite detail, suggesting an event of greater suddenness than had been 
known even in Darwin’s time and detailing a greater diversity of 
biological form and architecture than had hitherto been imagined. 

Where did this wealth of biological form come from, and why, again, 
did it seem to arise so suddenly during the Cambrian period? Walcott 
was the first to explore the Burgess Shale, and he would be the first to 
suggest an answer to the questions it raised. 


THE BESTIARY 


Among paleontologists, the fateful clue that led to the Burgess Shale’s 
discovery is the stuff of legend. Paleontologist Stephen Jay Gould 
considered it to have been rendered best in an obituary of Charles 
Walcott written by Walcott’s former research assistant, Charles 
Schuchert: 


One of the most striking of Walcott s faunal discoveries came at 
the end of the field season of 1909, when Mrs. Walcott’ horse slid 
on going down the trail and turned up a slab that at once attracted 
her husband 5 attention. Here was a great treasure—wholly 
strange Crustacea of Middle Cambrian time—but where in the 
mountain was the mother rock from which the slab had come? 
Snow was even then falling, and the solving of the riddle had to be 
left to another season, but next year the Walcotts were back again 
on Mount Wapta, and eventually the slab was traced to a layer of 


shale—later called the Burgess Shale—3000 feet above the town 
of Field.” 


Gould quotes the legend to celebrate its archetypal appeal even as he 
debunks it: “Consider the primal character of this tale—the lucky break 
provided by the slipping horse, . . . the greatest discovery at the very last 
minute of a field season (with falling snow and darkness heightening the 
drama of finality), the anxious wait through a winter of discontent, the 
triumphant return and careful, methodical tracing of errant block to 
mother lode.”* A compelling story, Gould concludes, but pure fiction. 
Walcott’s own diaries reveal that his team had plenty of time to begin 
excavating the site that very summer amid cooperative weather and even 
warm nights. As for their return the following summer, locating the 
mother lode was apparently the work of a single day rather than a full 
week, a conclusion Gould drew from both Walcott’s diaries and his 
knowledge of Walcott’s expertise as a geologist. 

The motifs of the lucky break, the frustrating delay, and the final and 
fortuitous triumph will resurface later (see Chapter 7) as a tall tale of 
Gould’s own, but for now consider only the scientific community’s 
weakness for staging the Burgess discovery with various fictional props, 
as if the stunning scenery around it were not setting enough. This 
weakness for theater is understandable, considering what Walcott and 
later investigators found there. Over the next several years, Walcott’s 
team alone collected more than 65,000 specimens, many of them 
astonishingly well preserved, some so bizarre that paleontologists would 
cast about for more than half a century for the proper categories in which 
to contain them. 

Consider just one odd couple from Walcott’s quarry, Marrella and 
Hallucigenia. Marrella, also called a lace crab, is an unusual form. 
Walcott described it as a type of trilobite, but later studies by Cambridge 
paleontologist Harry Whittington classified it not as a trilobite, nor a 
chelicerate (the subgroup of arthropods that includes spiders), and not 
even as a crustacean, but rather as a fundamentally distinct form of 
arthropod.* The creature is divided into twenty-six segments, each with 


a jointed leg for walking and a feathery gill branch for swimming. Its 
head shield has two long pairs of spikes directed backwards, and the 
underside of the head features two pairs of antennae. One is short and 
stout, the other long and sweeping (see Fig. 2.3). 

Hallucigenia belongs to a genus and family of one. It has a rounded 
mass at one end (possibly the head) connected to a cylinder-shaped trunk 
sporting seven pairs of spines projecting upward and to either side, each 
of them almost as long as the trunk itself (see Fig. 2.4). On the underside 
of the creature are seven pairs of limbs, each corresponding in position 
to one of the pairs of spines on the back, though with the tentacle 
farthest back offset. The underbelly also features three pairs of shorter 
tentacles before the trunk tapers and curves upward in what was 
probably a flexible extension from the body. Each of the larger tentacles 
appears to have a hollow tube connected to the gut and a pincer at the 
tip. This ancient creature was so peculiar that paleontologists feigned 
disbelief at what they saw, giving it its memorable name. 





FIGURE 2.3 

Figure 2.3a (left): Artist rendering of Marrella splendens. Figure 2.3b 
(right): Photograph of Marrella splendens fossil. Courtesy Wikimedia 
Commons, user Smith609. 





FIGURE 2.4 

Figure 2.4a (top): Artist rendering of Hallucigenia sparsa. Figure 2.4b 
(bottom): Photograph of AHallucigenia sparsa fossil. Courtesy 
Smithsonian Institution. 


The term “Cambrian explosion” was to become common coin, 
because Walcott’s site suggested the geologically abrupt appearance of a 
menagerie of animals as various as any found in the gaudiest science 
fiction. During this explosion of fauna, representatives of about twenty 
of the roughly twenty-six total phyla present in the known fossil record 
made their first appearance on earth (see Fig. 2.5).° 

The term “phyla” (singular: “phylum”) refers to divisions in the 
biological classification system. The phyla constitute the highest (or 
widest) categories of biological classification in the animal kingdom, 
with each exhibiting a unique architecture, organizational blueprint, or 
structural body plan. Familiar examples of phyla are cnidarians (corals 
and jellyfish), mollusks (squid and clams), echinoderms (sea stars and 
sea urchins), arthropods (trilobites and insects), and the chordates, to 
which all vertebrates including humans belong. 

The animals within each phylum exhibit distinguishing features that 
enable taxonomists to divide and group them further into other, 
progressively smaller divisions, beginning with classes and orders, and 
eventually coming to families, genera, and individual species. The 
broadest and highest categories within the animal kingdom—such as 
phyla and classes—designate the major categories of animal life, 
typically designating unique body plans. Lower taxonomic categories— 
like genus and species—designate smaller degrees of difference among 
organisms that typically exemplify similar overall ways of organizing 
their body parts and structures. 

Throughout the book I will use these conventional categories of 
classification, as do most Cambrian paleontologists. Nevertheless, I am 
aware that some paleontologists and systematists (experts in 
classification) today prefer “phylogenetic classification,” a method that 
often uses a “rank-free” classification scheme. Advocates of modern 


phylogenetic classification argue that the traditional classification 
system lacks objective criteria by which to decide whether a certain 
group of organisms should be assigned a particular rank of, for example, 
phylum or class or order.’ Proponents of rank-free classification attempt 
to eliminate subjectivity in classification (and ranking) by grouping 
together animals that are thought, based upon studies of similar 
molecules in different groups, to share a common ancestor. This method 
of classification treats groups that emerge at roughly the same time on 
the tree of life as equivalent. Nevertheless, even proponents of 
phylogenetic classification often use the conventional taxonomic 
categories in their technical discussions of specific organisms because of 
their common scientific usage. So despite my own sympathy with some 
of the concerns of rank-free advocates (see below), I have chosen to do 
the same. 
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FIGURE 2.5 

Figure 2.5a (top): Chart showing when representatives of the different 
animal phyla first appeared in the fossil record. According to Darwinian 
theory, differences in biological form should increase gradually, steadily 
increasing the number of distinct body plans and phyla, over time. 
References for first appearances are found in note 5 of this chapter. 
Figure 2.5b (bottom, left) expresses that expectation graphically, 
showing the number of new phyla increasing steadily as members of one 
phylum diversify and give rise to new phyla. Figure 2.5c (bottom, right) 
shows the actual pattern of first appearance showing a spike in the 
number of phyla that first appear in the Cambrian, followed by either 
few or no new phyla arising in subsequent periods of geological history. 


In any case, it’s worth noting that using a rank-free classification 
system does not minimize the mystery of the Cambrian explosion. The 
Cambrian explosion presents a puzzle for evolutionary biologists, not 
just because of the number of phyla that arise, but rather because of the 
number of unique animal forms and structures that arise (as measured, 
perhaps, by the number of phyla)—however biologists decide to classify 
them. Thus, whether scientists decide to use newer rank-free 
classification schemes or older, more conventional, Linnaean categories, 
the “evolutionary novelties’”—that is, the new anatomical structures and 
modes of organization—that arise suddenly with the Cambrian animals 
remain as facts of the fossil record, requiring explanation. (For an 
expanded technical discussion of these issues, go to this endnote.) 

One especially dramatic fact of the Cambrian explosion is the first 
appearance of many novel marine invertebrate animals (representatives 
of separate invertebrate’ phyla, subphyla, and classes in the traditional 
classification scheme). Some of these animals have mineralized 
exoskeletons, including those representing phyla, such as echinoderms, 
brachiopods, and arthropods, and each represent clearly distinct and 
novel body plans. Further, these are just three of dozens of novel body 
plans exemplified by the Burgess animals—animals in which both soft 
and hard parts are well preserved (see Fig. 2.6). 
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FIGURE 2.6 
Representatives of some of the major animal groups that first appeared 
in the sedimentary rock record during the Cambrian period. 


The variety in the Burgess Shale was so extreme it took several 
decades for paleontologists to grasp it fully. Walcott, for instance, 
attempted to fit all of the new forms into existing phyla. However, even 
in the midst of this attempt, he realized that this revolutionary quarry 
posed a problem more fundamental than a need to tidy up the existing 
taxonomy. He had met Louis Agassiz at a young age, having sold him 
some of his first fossils, and later described him as “a guide in whom I 
could trust and follow,” one in whose work “I find this tribute to the 
Great Mind that created the objects of his study.”!° But in the great 
debate between Agassiz and Darwin, Walcott sided with the 
Englishman. Thus, the Burgess Shale struck Walcott as not only 
fascinating, but puzzling. 


A PUZZLING PATTERN 


Over the years, as paleontologists have reflected on the overall pattern of 
the Precambrian—Cambrian fossil record in light of Walcott’s 
discoveries, they too have noted several features of the Cambrian 


explosion that are unexpected from a Darwinian point of view!! in 


particular: (1) the sudden appearance of Cambrian animal forms; (2) an 
absence of transitional intermediate fossils connecting the Cambrian 
animals to simpler Precambrian forms; (3) a startling array of 
completely novel animal forms with novel body plans; and (4) a pattern 
in which radical differences in form in the fossil record arise before 
more minor, small-scale diversification and variations. This pattern turns 
on its head the Darwinian expectation of small incremental change only 
gradually resulting in larger and larger differences in form. 


THE MISSING TREE 


Figures 2.7 and 2.8 illustrate the difficulty posed by the first two of these 
features: sudden appearance and missing intermediates. These diagrams 
graph morphological change over time. The first shows the Darwinian 
expectation that changes in morphology should arise only as tiny 
changes accumulate. This Darwinian commitment to gradual change 
through microevolutionary variations produces the classic representation 
of evolutionary history as a branching tree. 

Now compare this branching tree pattern with the pattern in the fossil 
record. The bottom part of Figure 2.7 and Figure 2.8 show that the 
Precambrian strata do not document the expected transitional 
intermediates between Cambrian and Precambrian fauna. Instead, the 
Precambrian—Cambrian fossil record, especially in light of the Burgess 
Shale after Walcott, points to the geologically sudden appearance of 
complex and novel body plans. 
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FIGURE 2.7 

The origin of animals. Darwinian theory (top) predicts gradual 
evolutionary change in contrast to the fossil evidence (bottom), which 
shows the abrupt appearance of the major animal groups. 


panes ga 


IDEALIZED 
q REPRESENTATION OF 
FOSSIL EVIDENCE 


i wy 
ANN i j | i 








TIME 


Ly 
7N 
(l 
—N 
> 
Ky 
W 
H 
Po 
Ee 
o 
2 
o0. 
P 
0000 
o2 
e0ce 


oO. 


<—  NEC-DARWINIAN 
Ig. %8 Se? THEORY 


(e2 
00009 
Oo 
009 


coo 





MORPHOLOGY —————————— > 


FIGURE 2.8 

According to Darwinian theory, the strata beneath the Cambrian rocks 
should evidence many ancestral and intermediate forms. Such forms 
have not been found for the vast majority of animal phyla. These 
anticipated but missing forms are represented by the gray circles. Lines 
and dark circles depict fossilized representatives of phyla that have 
been found. 


Of course, the fossil record does show an overall increase in the 
complexity of organisms from Precambrian to Cambrian times, as 
Darwin expected. But the problem posed by the Burgess Shale is not the 
increase in complexity, but the sudden quantum leap in complexity. The 
jump from the simpler Precambrian organisms (further explored in the 
next chapters) to the radically different Cambrian forms appears to occur 
far too suddenly to be readily explained by the gradual activity of natural 
selection and random variations. Neither the Burgess Shale nor any other 
series of sedimentary strata known in Walcott’s day recorded a pattern of 
novel body plans arising gradually from a sequence of intermediates. 


Instead, completely unique organisms such as the bizarre arthropod 
Opabinia (see Fig. 2.9)—with its fifteen articulated body segments, 
twenty-eight gills, thirty flipper-like swimming lobes, long trunk-like 
proboscis, intricate nervous system, and five separate eyes!*—appear 
fully formed in the Cambrian strata along with representatives of other 
fundamentally different body plans and designs of equal complexity. 

Darwin, as we know, regarded the sudden appearance of the 
Cambrian animals as a significant challenge to his theory.!* Where 
natural selection had to bridge yawning chasms from relatively simple 
life-forms to exquisitely complex creatures, it would require great 
expanses of time.!* 








FIGURE 2.9 
Figure 2.9a (top): Artist rendering of Opabinia. Figure 2.9b (bottom): 
Photograph of Opabinia fossil. 


Darwin’s recognition!> of this constraint was prescient. Geologists in 
his day employed relative dating methods. They did not have modern 
radiometric methods for determining the “absolute” ages of rocks. For 
this reason, they did not yet fully understand how long it would have 
taken to accumulate the great columns of sedimentary rock and, thus, the 


great expanses of time that were available to the evolutionary process. 
Neither had scientists yet discovered the sophisticated inner workings of 
the cell, and the information-rich structures (DNA, RNA, and proteins) 
that had to be significantly altered to achieve even modest evolutionary 
changes. Nevertheless, Darwin was able—based upon what he knew of 
the complexity of organisms and his own understanding of how the 
mechanism of natural selection must operate—to deduce that descent 
with modification required time, and lots of it. 

Recalling the context of Darwin’s original argument reveals why. In 
the Origin, he sought to counter the famous watch-to-watchmaker 
design argument offered by theologian William Paley. Paley had argued 
that just as complex structures such as watches necessarily issue from 
intelligent watchmakers, the complex structures in living organisms 
must likewise owe their origin to a designing intelligence. With natural 
selection, Darwin proposed a purely natural mechanism for constructing 
the complex organs and structures (such as eyes) present in many forms 
of life. His mechanism of natural selection worked by constructing such 
systems one tiny step at a time, discarding the harmful variations and 
seizing upon the rare improvement. If evolution progressed by “whole 
watches’”—that is, by entire anatomical systems like the trilobite’s eye— 
then biology would have fallen back to the old absurdity of imagining 
that a watch could fall together purely at random and all at once. Thus, 
unless Darwin’s evolutionary mechanism worked gradually by 
preserving the tiniest of random changes over many millions of years, it 
didn’t work at all. 


MORE MISSING LINKS 


Two other features of the Cambrian explosion revealed in the Burgess 
Shale, features (3) and (4) described earlier, not only confirmed the 
reality of the Cambrian mystery, but broadened and deepened it—and at 
just the time when paleontologists were looking to resolve the mystery 
with new fossil discoveries. 


First, the great profusion of completely novel forms of life in the 
Burgess assemblage (feature 3) demanded even more transitional forms 
than had previously been thought missing. Each new and exotic 
Cambrian creature—the anomalocarids (see Fig. 2.10), Marrella, 
Opabinia, and the bizarre and appropriately named Hallucigenia—for 
which there were again no obvious ancestral forms in the lower strata, 
required its own series of transitional ancestors. But where were they? 





Figure 2.10a (top): Artist rendering of Anomalocaris. Figure 2.10b 
(bottom): Photograph of Anomalocaris fossil. Courtesy J. Y. Chen. 


Darwin had hoped that later fossil discoveries would eventually 
eliminate what he regarded as the one outstanding anomaly associated 
with his theory. Walcott’s discovery was not that discovery. Not only did 
the Burgess Shale fail to reveal the expected ancestral precursors of the 
known Cambrian animal forms, but it revealed a motley crew of 
previously unknown animal forms and body plans that now demanded 
their own lengthy chain of evolutionary precursors, only complicating 
the task of explaining the Cambrian explosion in Darwinian terms. 


ORDERS FROM THE TOP 


The Burgess Shale raised an additional difficulty (feature 4, discussed 
earlier), though not one that Walcott recognized during his lifetime. 
Instead, its exposition would await a later generation of Cambrian 
experts, particularly Stephen Jay Gould. Darwin’s theory implied that as 
new animal forms first began to emerge from a common ancestor, they 
would at first be quite similar to each other, and that large differences in 
the forms of life—what paleontologists call disparity—would only 
emerge much later as the result of the accumulation of many incremental 
changes. In its technical sense, disparity refers to the major differences 
in form that separate the higher-level taxonomic categories such as 
phyla, classes, and orders. In contrast, the term diversity refers to minor 
differences among organisms classified as different genera or species. 
Put another way, disparity refers to life’s basic themes; diversity refers to 
the variations on those themes. The more body plans in a fossil 
assembly, the greater the disparity. And the animal forms preserved in 
the Burgess Shale display considerable disparity. Further, the large 
differences in form between the first animals appeared suddenly in the 
Burgess Shale, and the appearance of such disparity arose before, not 
after, the diversification of many representatives of lower taxonomic 
categories (such as species or genera) within each higher category, 
designating a new body plan. 

The site of the Burgess Shale and its setting nicely illustrates the 
difference between diversity and disparity. Walcott’s celebrated quarry is 
tucked away in the Canadian Rockies near the Continental Divide. 
Reaching it involves a six-mile hike through the picturesque scenery of 
Yoho National Park—Takakkaw Falls, Emerald Lake, and glaciers and 
glacier-cut mountain peaks thrusting into view at almost every turn. In 
this ecologically diverse setting, hikers have a chance of spotting 
squirrels, marmots, deer, moose, elk, wolves, and mountain goats. Rare 
sightings might include a grizzly bear or Canadian lynx, while alert 
birdwatchers might glimpse a horned lark, a white-tailed ptarmigan, the 
rare water pipit, or a gray-crowned rosy finch; an eagle, hawk, or 
grassland falcon; dippers, jays, migrating warblers, or harlequin ducks.!° 


As richly various as this array of animals is, all of them come from a 
single phylum, Chordata—and even from a single subphylum, 
Vertebrata. Imagine hiking to the quarry to excavate it and, on the hike, 
being lucky enough to spot every one of these animals along the way. 
After having feasted your eyes on such animal variety, when you arrive 
at Walcott’s quarry, it yields not merely dozens of fossilized species 
from a single subphylum, but wildly disparate creatures from dozens of 
phyla. 

According to Darwin’s theory, the differences in form, or 
“morphological distance,” between evolving organisms should increase 
gradually over time as small-scale variations accumulate by natural 
selection to produce increasingly complex forms and structures 
(including, eventually, new body plans). In other words, one would 
expect small-scale differences or diversity among species to precede 
large-scale morphological disparity among phyla. As the former Oxford 
University neo-Darwinian biologist Richard Dawkins puts it, “What had 
been distinct species within one genus become, in the fullness of time, 
distinct genera within one family. Later, families will be found to have 
diverged to the point where taxonomists (specialists in classification) 
prefer to call them orders, then classes, then phyla.”!7 

Darwin himself made this point in On the Origin of Species. 
Explaining his famous tree diagram (see Fig. 2.1la), he noted that it 
illustrated more than just the theory of universal common descent. The 
tree diagram also illustrated how higher taxa should emerge from lower 
taxa by the accumulation of numerous slight variations. He said, “The 
diagram illustrates the steps by which small differences distinguishing 
varieties are increased into larger differences distinguishing species.” !® 
He went on to assert that the process of modification by natural selection 
would eventually move beyond the formation of species and genera to 
form “two distinct families, or orders, according to the amount of 
divergent modification supposed to be represented in the diagram.”!° In 
his view, this process would continue until it produced differences in 
form that were great enough that taxonomists would classify them as 
new classes or phyla. In short, diversity would precede disparity, and 


phyla-level differences in body plan would emerge only after species-, 
genus-, family-, order-, and class-level differences appeared. 

The actual pattern in the fossil record, however, contradicts this 
expectation (compare Fig. 2.12 to Fig. 2.11b). Instead of more and more 
species eventually leading to more genera, leading to more families, 
orders, classes, and phyla, the fossil record shows representatives of 
separate phyla appearing first followed by lower-level diversification on 
those basic themes. 

This is nowhere more dramatically apparent than in the Cambrian 
period explains Roger Lewin in the journal Science: “Several possible 
patterns exist for the establishment of higher taxa, the two most obvious 
of which are the bottom-up and the top-down approaches. In the first, 
evolutionary novelties emerge, bit by bit. The Cambrian explosion 
appears to conform to the second pattern, the top-down effect.” Or as 
paleontologists Douglas Erwin, James Valentine, and Jack Sepkoski note 
in their study of skeletonized marine invertebrates: “The fossil record 
suggests that the major pulse of diversification of phyla occurs before 
that of classes, classes before that of orders, orders before that of 
families. .. . The higher taxa do not seem to have diverged through an 
accumulation of lower taxa.””! In other words, instead of a proliferation 
of species and other representatives of lower-level taxa occurring first 
and then building to the disparity of higher taxa, the highest taxonomic 
differences such as those between phyla and classes appear first 
(instantiated by relatively few species-level representatives). Only later, 
in more recent strata, does the fossil record document a proliferation of 
representatives of lower taxa: different orders, families, genera, and so 
on. Yet we would not expect the neo-Darwinian mechanism of natural 
selection acting on random genetic mutations to produce the top-down 
pattern that we observe in the history of life following the Cambrian 
explosion. 























FAMILY #1 FAMILY #2 


GENUS | GENUS GENUS | GENUS 
GENUS GENUS GENUS #1 #2 #3 t4 
#1 s #3 


N OD DOP OO 


SPECIES. 





FIGURE 2.11 

Figure 2.1la (top): Darwin’s theory of common descent illustrated here 
with his famous branching tree of life diagram reproduced from the 
Origin of Species, 1859. Figure 2.11b (bottom): Growth of the tree of 
life over time in the manner envisioned by Darwin with new species 
giving rise to new genera and families, eventually giving rise to new 
orders, classes, and phyla (these higher taxonomic categories not 
depicted). 


Of course, advocates of modern phylogenetic classification, with 
their “rank-free” approach, don’t describe this phenomenon as a “top- 
down” pattern, because their system of classification dispenses with 
taxonomic ranks and hierarchies. In their system, there is no “top” and 
no “down.” Nevertheless, advocates of phylogenetic classification do 
acknowledge that different combinations of “character” states 
(characteristics or features of organisms) can mark either bigger or 
smaller morphological differences between clades (closely related 
groups of organisms that presumably share a common ancestor). And 
some leading advocates of phylogenetic classification have noted that 
the fossil record exhibits a pattern in which a few character traits 
marking large morphological differences between clades arise first 
followed ater in each clade by the addition of other combinations of 
characters that mark smaller differences within those clades. Larger 
differences between clades arise first, smaller differences within them 
arose later—the themes precede the variations. 
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FIGURE 2.12 
The top-down pattern of appearance in the fossil record: disparity 
precedes diversity. 


The founder of the modern phylogenetic classification, Willi Hennig, 
for example, noted that once particular groups arise, the range of 
allowable variability within those groups narrows. In his classic work 
Phylogenetic Systematics, Hennig quoted another paleontologist 
approvingly who observed: “The breadth of evolution of successive 
groups shows a distinct narrowing, since the basic divergences of 
organization became progressively smaller. The type of mammals is 
more uniform and closed than that of the reptiles, which in turn is 
unquestionably uniform compared to that of the Amphibia- 
Stegocephalia.” Hennig goes on to explain that “the same phenomenon 
is repeated in every systematic unit of higher or lower order.”” 

Yet, on a Darwinian view, small-scale variations and differences 
should arise first, gradually giving rise to larger-scale differences in form 
—just the opposite of the pattern evident in the fossil record. Thus, the 
discovery, and later analysis, of the Burgess revealed another puzzling 
feature of the fossil record from a Darwinian point of view, regardless of 
which system of classification paleontologists prefer to use. Indeed, 
Walcott’s discovery turned Darwin’s anticipated bottom-up—or small 
changes first, big changes later—pattern on its head. 


FIRST IMPRESSIONS 


The extraordinary conditions at work in the preservation of the Burgess 
fauna helped to reveal the extent of the rich diversity (and disparity) of 
form present in the Cambrian period. On shale of a very fine grain, the 
Burgess fossils look like lithographic pictures, dark on light (see color 
insert plates 15 and 16). Even the soft parts like the gills and guts are 
sometimes preserved. This is not the norm in the world of paleontology. 
Usually soft tissues decay before they can be fossilized, leaving behind 
only harder parts, such as bone, teeth, and shells, to be preserved. The 
Burgess event that captured the Cambrian fauna for future discovery was 
different. Although it took the lives of untold Cambrian animals, it did 
so with an exquisite delicacy that preserved soft tissue. 

Visualizing how this occurred will drive home why the conditions 
were so unusual. All of the fossilized animals of the Burgess Shale were 
sea creatures living near an enormous carbonate reef that later was thrust 
upward by plate tectonic activity to form what is now called the 
Cathedral Escarpment. Long after the sea creatures of the Burgess Shale 
were entombed, these tectonic forces drove the fossils upward from the 
seafloor carrying them many miles eastward along faults, at the same 
time building the mountains that Walcott would climb millions of years 
later. 

Thanks to this tectonic movement of earth’s major plates, the 
continents are now located in very different places than they were 
millions of years ago. At the time these Cambrian creatures were alive, 
the land masses that would later form North America lay on the equator. 

Plate tectonic activity explains why a trove of sea creatures were 
found fossilized in the mountains of Yoho National Park rather than 
along a seafloor somewhere. But there’s still the question of why so 
many different types of marine invertebrates, including soft-bodied ones, 
were so unusually well preserved. Paleontologists think they know the 
answer. They think the marine animals that were later fossilized in the 
Burgess Shale lived near the bottom of an ancient sea in front of an 
underwater cliff or escarpment. Due to tectonic activity, blocks at the 
edge of this underwater cliff began to break off. These blocks slumped, 


creating underwater mudflows in their wake. These slumps and flows 
transported the Burgess animals several kilometers into deeper waters 
where they were buried in such a way as to leave them not only 
undamaged, but also protected from scavengers and bacteria. Very 
probably, the mudflows were highly turbulent, for paleontologists found 
the creatures dumped and preserved in a variety of angles in relation to 
the bedding. The speed and pressure of these mudflows quickly 
produced a preservation-friendly, oxygen-free environment. Then the 
turbulent and muddy currents pressed fine silt and clay into the crevices 
of the bodies at just the right consistency and pressure to fossilize them 
without tearing their delicate appendages, an ideal set of circumstances 
for ensuring later observation by future paleontologists.” 

Due in part to the unusual circumstances under which these fossils 
were preserved, there’s now little doubt about the unparalleled disparity 
of the Cambrian fauna. Based on available evidence from the Burgess 
Shale and other sites around the world, the Cambrian period witnessed 
the appearance of, arguably, more disparate body plans than ever before 
or since. And this disparity arose at a most unexpected time, assuming 
Darwinian theory, namely, right at the dawn of animal life. 


THE ARTIFACT HYPOTHESIS 


Walcott grasped these difficulties, and had a deep enough professional 
commitment to Darwinism, to search for a solution. He realized that the 
Precambrian fossil record could, in principle, assist in explaining the 
pattern of the Cambrian fossil record. The discovery of a rich 
Precambrian fossil history detailing variations accumulating little by 
little would serve to cast the pattern of the Burgess in a different light. 
Yet the Precambrian strata of his day showed no signs of providing any 
obvious transitional forms, much less a well-articulated bottom-up 
pattern of animals representing lower taxa proliferating into forms 
exemplifying higher and higher taxonomic categories. Nevertheless, an 
idea occurred to Walcott that gave him fresh hope. 


Perhaps his awareness of the dramatic ways that the surface of the 
earth had changed over geologic time, making preservation of the 
Burgess fauna itself possible, inspired him. Finding marine animals so 
high above sea level no doubt made Walcott acutely aware of the way in 
which continents and seas had changed locations over the course of 
geological time. And so, Walcott, ever the geologist, proposed an 
ingenious geological solution to the biological problem of the origin of 
animal life. He noted that the Precambrian period was a period of 
dramatic continental uplift. He then suggested that the ancestors of the 
trilobites first evolved at a time when the Precambrian seas had receded 
from the landmasses. Then, at the beginning of the Cambrian, the seas 
again rose, covering the continents and depositing recently evolved 
trilobites. Thus, according to Walcott, ancestral precursors to the 
trilobites and other distinctive Cambrian forms had existed, but they 
were not fossilized in sediments that would later be elevated above sea 
level until early in the Cambrian; instead, before the Cambrian, during a 
period when sea levels were lower, trilobites and their ancestral forms 
were being deposited offshore in what are now only deep-sea 
sediments.” Walcott named this cryptic period of time in which 
trilobites and other animals were rapidly evolving offshore as the 
“Lipalian interval.” (The term “Lipalian” is derived from the Greek 
word for lost.) In this view, the abrupt appearance of the Cambrian body 
plans in the geological column was merely an “artifact” of incomplete 
sampling of the fossil record and, indeed, the inability to access the 
undersea sedimentary layers where the ancestors of the Cambrian fauna 
presumably lay encased. In short, the transgression and regression of 
ancient seas made the ancestral precursors of the Cambrian fauna 
inaccessible to discovery. 

His artifact hypothesis (also known as the “Lipalian interval” 
hypothesis) was a distinct advance over Darwin’s unadorned claim that 
the fossil ancestors of the Cambrian animals had not yet been 
discovered. Walcott’s hypothesis had the advantage of accounting for the 
sudden appearance of the trilobites and the absence of ancestral and 
transitional forms by reference to known geological processes. It also 


could be tested, at least once offshore drilling technology advanced to 
allow for the sampling of the buried offshore sedimentary rocks. 

Although Walcott conceded that his hypothesis was essentially a 
negative argument that attempted to explain away the absence of 
evidence, he insisted that it was a sensible inference from his broad 
sampling of the paleontological data. “I fully realize that the conclusions 
above outlined are based primarily on the absence of a marine fauna in 
Algonkian [ Precambrian] rocks,” he wrote, “but until such is discovered 
I know of no more probable explanation of the abrupt appearance of the 
Cambrian fauna than that I have presented.””> 


LUMPING AND SPLITTING 


Walcott used another strategy for squaring the Burgess Shale with 
Darwin’s theory of evolution. Taxonomists, tasked with identifying and 
naming distinct groups of life-forms, have been divided into two types: 
“lumpers” and “splitters.” “Lumpers” tend to group disparate organisms 
together in the same large classificatory categories and then make 
distinctions between them at lower taxonomic levels. “Splitters” tend to 
separate similar organisms into numerous higher taxonomic divisions. 
Walcott favored lumping, and his doing so with the Burgess fossils 
seemingly minimized the difficulties associated with the sudden 
proliferation of so many new Cambrian forms. 

On his return to the Smithsonian, he placed all of the exotic forms of 
the Burgess into modern phyla. One of his efforts at lumping placed 
Marrella splendens not only in the same phylum, but also in the same 
class (Trilobita) as the trilobites, despite obvious morphological 
differences. He justified this classification by arguing that the organism 
foreshadowed the trilobite (compare Figs. 1.4 and 2.3). Gould later 
criticized Walcott’s method of classification as “shoehorning.” He noted 
that even one of Walcott’s fellow lumpers, Yale paleontologist Charles 
Schuchert, called the classification of Marrella into question.*° Gould 
also noted that Walcott used this strategy to minimize the challenge 
posed by the morphological disparity of the Burgess forms.’ 


Some paleontologists today reject Gould’s criticism of Walcott’s 
inclusion of so many Burgess animal forms into existing taxonomic 
categories.Nevertheless, few paleontologists think Walcott’s use of 
lumping explained away the Cambrian explosion. Most, for example, 
classify Marrella splendens within an existing modern phylum, namely, 
Arthropoda, even if they also classify it within a new and separate class, 
Marrellomorpha. Yet, whether Marrella, for example, falls within a 
novel phylum or class, matters less than explaining why so many clearly 
novel forms, and the novel structures these forms exhibit, first arose with 
such apparent suddenness. 


RESOLUTION—FOR A TIME 


Walcott thought that he had solved the mystery of the Cambrian 
explosion, as did many other Darwinists who gratefully adopted both his 
taxonomy and his version of the artifact hypothesis. And since Walcott’s 
approach held out hope of one day uncovering evidence of a 
Precambrian trunk for the animal phyla along with its primary limbs, 
adherents could not be accused of moving the paleontological case for 
Darwinism into the realm of untestable dogma. They had only to wait 
for the technologies of seafloor drilling to emerge and hope that nature 
had seen fit, unmolested under the ocean deeps, to leave concrete 
evidence of the gradual emergence of the major Cambrian body plans. 

Walcott’s theoretical accomplishment was no mean feat. His 
discovery of the Burgess Shale was like a defense attorney with absolute 
faith in his client stumbling upon a room stuffed with clues that would 
seem to discredit him. Through his grouping of disparate body types into 
existing phyla and his ingenious version of the artifact hypothesis, 
Walcott had found an elegant way to explain all this seemingly 
uncooperative evidence in a Darwinian way. 

In defending Walcott for overlooking significant features of the 
Burgess fossils, Gould points out that Walcott’s multiple and growing 
administrative demands hardly left time to revisit the foundational 
categories of animal taxonomy. How much less, then, was Walcott likely 


to revisit the most fundamental assumption of all—the assumption that 
animals originated gradually in a Darwinian way as the result of natural 
selection acting on small incremental variations? Consideration of that 
possibility would come decades later, only after Walcott’s version of the 
artifact hypothesis had itself exploded. 


3 
Soft Bodies and Hard Facts 


In the spring of 2000, Discovery Institute, where I do my research, 
sponsored a lecture at the University of Washington geology department 
by renowned Chinese paleontologist J. Y. Chen (see Fig. 3.1). As the 
result of his role in excavating a new discovery of Cambrian-era fossils 
in southern China, Professor Chen’s standing in the scientific world was 
on the rise. The discovery, near the town of Chengyjiang in the Kunming 
Province, revealed a trove of early Cambrian animal forms. After Time 
magazine mentioned the Chengjiang discovery in a 1995 cover story 
about the Cambrian explosion,! interest in the fossils surged. When he 
came to Seattle, Professor Chen had already published numerous 
scientific papers about this profusion of novel life-forms and had 
established himself as one of the foremost experts on the fossils in this 
unique geological setting. 

Not surprisingly, Chen’s visit generated considerable interest among 
the University of Washington faculty. He came bearing intriguing 
photographs and samples of the oldest and most exquisitely preserved 
Cambrian fossils in the world from an exotic site halfway around the 
globe, a site, moreover, that was now widely acknowledged to surpass 
even the legendary Burgess Shale as the most extensive and significant 
Cambrian-era locality. 

The fossils from the Maotianshan Shale near Chengjiang (see Fig. 
3.2) had established an even greater variety of Cambrian body plans 
from an even older layer of Cambrian rock than those of the Burgess, 
and they did so with an almost photographic fidelity. The Chinese fossils 


also helped to establish that the Cambrian animals appeared even more 
explosively than previously realized. 
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FIGURE 3.1 
J. Y. Chen. 
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FIGURE 3.2 

Figure 3.2a (left) shows the Moatianshan Shale outcrop. Courtesy 
Illustra Media. Figures 3.2b and c (center and right) show a 
Precambrian-Cambrian boundary marker at the Moatianshan site. 
Courtesy Paul Chien. 


So there was little doubt about the significance of the discoveries that 
Chen came to report that day. What was soon in doubt, however, was 
Chen’s scientific orthodoxy. In his presentation, he highlighted the 
apparent contradiction between the Chinese fossil evidence and 
Darwinian orthodoxy. As a result, one professor in the audience asked 
Chen, almost as if in warning, if he wasn’t nervous about expressing his 
doubts about Darwinism so freely—especially given China’s reputation 
for suppressing dissenting opinion. I remember Chen’s wry smile as he 
answered. “In China,” he said, “we can criticize Darwin, but not the 
government. In America, you can criticize the government, but not 
Darwin.” 

Nevertheless, those in the audience that day soon learned that 
Professor Chen had good reasons for questioning Darwin’s picture of the 
history of life. As Chen explained, the Chinese fossils turn Darwin’s tree 
of life “upside down.” They also cast doubt on a surviving version of 
Charles Walcott’s artifact hypothesis, a crucial prop in the case for 
Darwinian gradualism. 


BURGESS REVISITED 


By the time Charles Walcott finished his last excavation of the Burgess 
Shale in 1917, he and his team had collected over 65,000 fossil 
specimens, all of which were shipped to the Natural History Museum at 
the Smithsonian Institution for cataloguing. In 1930, another American 
paleontologist, Harvard professor Percy Raymond, initiated another 
investigation of the Burgess. His specimens were also eventually stored 
in the United States. 

As a result of these two prominent American-led excavations, there 
were initially no collections of Burgess fossils on public display in 
Canada. Many Canadian scientists regarded this as a national 
embarrassment, so in the 1960s the Canadian Geological Survey 
commissioned a British team to resume digging at the Walcott quarry, in 
order to “repatriate the Burgess Shale” by keeping most of the newly 
discovered fossils on permanent display in Canada.” The team was led 


by paleontologist Harry Whittington (see Fig. 3.3), of the University of 
Cambridge, who was assisted by two of his graduate students, Simon 
Conway Morris and Derek Briggs, both of whom would eventually 
distinguish themselves as internationally renowned experts on the 
Burgess Shale. 
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FIGURE 3.3 
Harry Whittington. Courtesy Archives of the Museum of Comparative 
Zoology, Ernst Mayr Library, Harvard University. 


As Whittington analyzed the Cambrian fauna at the Burgess, he 
realized that Walcott had grossly underestimated the morphological 
disparity of this group of animals. Many of the creatures in the 
assemblage featured unique body designs, unique anatomical structures, 
or both. Opabinia, with its five eyes, fifteen distinct body segments, 
and a claw at the end of a long proboscis, exemplified the unique forms 
on display in the Burgess. But so did Hallucigenia, Wiwaxia, Nectocaris, 
and many other Burgess animals. To this day, paleontologists describing 
Nectocaris, for example, can’t decide whether it more closely resembles 
an arthropod, a chordate, or a cephalopod (a class of mollusk; see Fig. 
3.4). 

Whittington found that grouping such forms within well-established 
taxonomic categories, even higher taxonomic categories such as the 
class or phylum, strained the limits of these classifications. Even many 
of those animals that fell easily into existing phyla represented clearly 
unique subphyla or classes of organisms. Anomalocaris (literally, 
“abnormal shrimp”) and Marrella, for example, both had hard 
exoskeletons and clearly represent either arthropods or creatures closely 
related to them. Yet each of these animals possessed many distinct 
anatomical parts and exemplified different ways of organizing these 
parts, thus clearly distinguishing themselves from better-known 
arthropods such as the previous staple of Cambrian paleontological 
studies, the trilobite. 

Whittington, a trilobite expert, understood this as well as anyone. In 
1971, he published the first comprehensive taxonomic review of the 
Burgess biota. In his review, he broke decisively with Walcott’s previous 
attempt to lump all Cambrian forms into a few preexisting taxonomic 
categories. 





FIGURE 3.4 

Figure 3.4a (left): Artist rendering of Nectocaris. Figure 3.4b (middle, 
right): Photograph of Nectocaris fossil. Reprinted by permission from 
Macmillan Publishers Ltd.: Nature, Martin R. Smith and Jean-Bernard 
Caron, “Primitive Soft-Bodied Cephalopods from the Cambrian,” 
Nature, 465 (May 27, 2010): 469-72. Copyright 2010. 


In so doing, he reemphasized the morphological disparity present in 
the Burgess animal biota and, in the process, deprived evolutionary 
biologists of one part of Walcott’s two-part strategy for minimizing the 
Cambrian problem. By lumping all Burgess animals into existing phyla 
and classes, Walcott had seemingly diminished the problem of disparity 
by reducing the number of novel phyla for which connecting 
intermediates were required. By recognizing the disparity clearly on 
display in the Burgess, Whittington partially undercut Walcott’s solution 
to the Cambrian mystery and highlighted what would become its central 
unsolved problem: the origin of novel biological form. 


SOUNDING THE SEAFLOOR 


Although Whittington, and later Gould, rejected Walcott’s early attempt 
to “shoehorn” all the animals of the Burgess Shale into preexisting 


taxonomic categories, many paleontologists now also reject Stephen Jay 
Gould’s characterization of many Burgess Shale creatures as being so 
exotic as to defy affinity in classification with any modern groups.* 
Many of these paleontologists would also recognize fewer total phyla 
first appearing in the Cambrian than Gould did, and perhaps even as few 
as Walcott did. As discussed in the previous chapter, still other 
paleontologists now favor “rank-free” approaches to classification. 

Regardless, most paleontologists recognize that the Burgess Shale 
attests to an extraordinary profusion of new animal forms—including 
many manifesting unique anatomical structures and arrangements of 
body parts. Thus, whatever differences of opinion exist about how to 
classify these animals—and any five-year-old child can distinguish them 
from each other and from all previously known forms of life—their 
origin still requires explanation. Thus, as noted, Walcott’s use of 
“lumping” did not solve the Cambrian mystery.” 

But what of the second part of Walcott’s proposal, the artifact 
hypothesis? To evaluate this hypothesis, Walcott devised a more clear- 
cut and less subjective test. Recall that Walcott argued that the ancestral 
precursors of the Cambrian animals were missing from the Precambrian 
fossil record because of the transgression and regression of seas. He 
posited an interval of geologic time in which the ancestors of the 
Cambrian fauna were evolving offshore in a Precambrian ocean and 
being deposited only in layers of marine sedimentary rock. In this 
hypothesis, only after the ancient ocean rose and covered the continent 
were the remains of Cambrian sea animals preserved in sediment that 
today is above sea level. 

When Walcott proposed his ingenious geological scenario, it could 
not yet be tested. But with the development of offshore drilling 
technology in the 1940s, 1950s, and 1960s, oil companies began to drill 
through thousands of feet of marine sedimentary rock.® As geologists 
evaluated the contents of these drill cores, they did not find Walcott’s 
predicted Precambrian fossils. 

Instead, an even more fundamental problem for the hypothesis arose. 
At the time that Walcott proposed his version of the artifact hypothesis, 


geologists considered the oceanic and continental plates to be essentially 
stable and fixed with respect to one another. Mountain building, faulting, 
and other geologic processes were attributed to worldwide changes in 
sea level, accumulating troughs of sediment called geosynclines, rising 
mounds of igneous rocks from beneath the earth’s crust, and even a 
shrinking earth.’ 

The idea that enormous, solid plates actually move, recycling 
themselves through the plate-tectonic processes of subduction and 
seafloor spreading, had not yet been proposed. Yet modern plate tectonic 
theory now affirms that oceanic crustal material eventually plunges back 
into the earth and melts in a process known as subduction. After surface 
rocks melt during subduction, they form a new supply of molten magma. 
Eventually, magma from other locations deep in the earth wells up at 
mid-oceanic ridges to form new igneous rocks, in a process known as 
seafloor spreading. It follows that any oceanic sediments deposited atop 
the oceanic igneous crust will have a limited “life span” on the surface 
of the earth. Eventually, these sedimentary rocks collide with the 
continental margin, plunge deep into the upper mantle, and melt to form 
magma. 

As a consequence of this cycle, the maximum age of any marine 
sediment is strictly limited. And according to modern estimates, the 
oldest section of oceanic crust has existed only since the Jurassic (or 
about 180 million years ago®)—far too young to contain fossil ancestors 
of the trilobites. As the evidence for plate tectonics mounted, scientists 
discarded Walcott’s artifact hypothesis and Lipalian interval as 
nonstarters. Paleontologists today do not expect to find any Precambrian 
ancestors of the trilobites in oceanic sediments, since they realize that 
there are no Precambrian sediments in the ocean basins. If Precambrian 
strata are to be found anywhere, continents are the place. 


OTHER VERSIONS OF THE ARTIFACT HYPOTHESIS 


Although Walcott’s proposals to explain away the absence of fossilized 
ancestors of the Cambrian animals came to naught, other versions of the 


artifact hypothesis continued to circulate. These proposals take two basic 
forms. Some scientists claimed, though for different reasons, that the 
expected Precambrian fossil ancestors had simply not yet been found— 
that missing fossils were an artifact of the incomplete sampling of the 
fossil record. Others suggested that Precambrian sedimentary rocks had 
not preserved the missing fossils—that the incomplete preservation of 
the Precambrian animals meant the missing fossils were no longer there 
to be found. 

Walcott rejected the idea that paleontologists simply had not looked 
in, or sampled, enough places. He noted that geologists already had 
extensively investigated “the great series of Cambrian and Precambrian 
strata in eastern North America.” Though they had looked “from 
Alabama to Labrador; in western North America [and] from Nevada and 
California far into Alberta and British Columbia, and also in China” 
their investigations had turned up nothing of significant interest.? In 
Walcott’s view the continents simply had not preserved the fossilized 
remains of the Cambrian ancestors. 

Before Walcott, some geologists had gone a step farther and 
suggested that all Precambrian sedimentary rocks had been destroyed 
via extreme heat and pressure, a process called “universal 
metamorphism.” Walcott rejected this hypothesis, since he himself had 
encountered a “great series of pre-Cambrian sedimentary rocks on the 
North American continent” among other places. Other geologists 
suggested that major bursts of evolutionary innovation occurred only 
during periods when sedimentary deposition had ceased, thus again 
resulting in a lack of fossil preservation. But, as Gould remarked of 
Walcott’s artifact hypothesis, this explanation also appeared to many 
scientists “forced and ad hoc . . . born of frustration, rather than the 
pleasure of discovery.”!° 


CONTEMPORARY VERSIONS OF THE ARTIFACT 
HYPOTHESIS: TOO SOFT OR TOO SMALL 


After the demise of the “universal metamorphism” idea, some 
paleontologists proposed simpler, more intuitively plausible versions of 
the artifact hypothesis. They claimed that the proposed intermediate 
forms leading to the Cambrian animals may have been either too small 
or too soft, or both, to have been preserved. 

Developmental biologist Eric Davidson, of California Institute of 
Technology, has suggested that the transitional forms leading to the 
Cambrian animals were “microscopic forms similar to modern marine 
larvae” and were thus too small to have been reliably fossilized.'! Other 
evolutionary scientists, such as Gregory Wray, Jeffrey Levinton, and Leo 
Shapiro, have suggested that the ancestors of the Cambrian animals were 
not preserved, because they lacked hard parts such as shells and 
exoskeletons.!* They argue that since soft-bodied animals are difficult to 
fossilize, we shouldn’t expect to find the remains of the supposedly soft- 
bodied ancestors of the Cambrian fauna in the Precambrian fossil record. 
University of California, Berkeley, paleontologist Charles R. Marshall 
summarizes these explanations: 


[I]t is important to remember that we see the Cambrian 
“explosion” through the windows permitted by the fossil and 
geological records. So when talking about the Cambrian 
“explosion, ” we are typically referring to the appearance of 
large-body (can be seen by the naked eye) and preservable (and 
therefore largely skeletonized) forms. . . . If the stem lineages were 
both small and unskeletonized, then we would not expect to see 
them in the fossil record.” 


Though intuitively plausible, several discoveries call into question 
both of these versions of the artifact hypothesis. As for the idea that the 
ancestors of the Cambrian animals were too small to be preserved, 
paleontologists have known for some time that the cells of filament- 
shaped microorganisms (probably cyanobacteria) have been preserved in 
ancient Precambrian rocks. Paleobiologist J. William Schopf, of the 
University of California, Los Angeles, has reported an extremely ancient 


example of these fossils in the Warrawoona Group strata of western 
Australia. These fossilized cyanobacteria are preserved in 3.465-billion- 
year-old bedded cherts (microcrystalline sedimentary rocks).'* The same 
strata have also preserved stromatolite mats, an organic accretionary 
growth structure usually indicating the presence of bacteria, within 
slightly younger dolostone sediments of roughly 3.45 billion years in age 
(see Fig. 3.5).!° 





FIGURE 3.5 

Figure 3.5a (left): Photographs of Cambrian-age fossil stromatolites. 
Courtesy Wikimedia Commons, user Rygel, M. C. Figure 3.5b (right): 
Alternating fine- and coarse-layered structure of Precambrian 
stromatolite fossils shown in cross section. Courtesy American 
Association for the Advancement of Science, Figure 2B, Hoffman, P., 
“Algal Stromatolites: Use in Stratigraphic Correlation and Paleocurrent 
Determination,” Science, 157 (September 1, 1967): 1043—45. Reprinted 
with permission from AAAS. 


These discoveries pose a problem for the idea that the Cambrian 
ancestors were too small to survive in the fossil record. The sedimentary 
rocks that preserve the fossilized cyanobacteria and single-celled algae 
are far older and, therefore, far more likely to have been destroyed by 
tectonic activity than those later sedimentary rocks that should have 
preserved the near-ancestors of the Cambrian animals. Yet these rocks, 
and the fossils they contain, have survived just fine. If paleontologists 
can find tiny fossilized cells in these far older and rarer formations, 
shouldn’t they also be able to find some ancestral forms of the Cambrian 


animals in younger and more abundant sedimentary rocks? Yet few such 
precursors have been found. 

There are also several reasons to question the second version of this 
hypothesis—the idea that the presumed Cambrian ancestors were too 
soft to be preserved. First, some paleontologists have questioned 
whether soft-bodied ancestral forms of the hard-bodied Cambrian 
animals would have even been anatomically viable.'° They argue that 
many animals representing phyla such as brachiopods and arthropods 
could not have evolved their soft parts first and then added shells later, 
since their survival depends upon their ability to protect their soft parts 
from hostile environmental forces. Instead, they argue that soft and hard 
parts had to arise together.!’ As paleontologist James Valentine, of the 
University of California, Berkeley, has noted in the case of brachiopods, 
“The brachiopod Bauplan [body plan] cannot function without a durable 
skeleton.”!® Or as J. Y. Chen and his colleague Gui-Qing Zhou observe: 
“Animals such as brachiopods . . . cannot exist without a mineralized 
skeleton. Arthropods bear jointed appendages and likewise require a 
hard, organic or mineralized outer covering.””!” 

Because these animals typically require hard parts, Chen and Zhou 
assume that the ancestral forms of these animals should have been 
preserved somewhere in the Precambrian fossil record if in fact they 
were ever present. Thus, the absence of hard-bodied ancestors of these 
Cambrian animals in the Precambrian strata shows that these animals 
first arose in the Cambrian period. As they rather emphatically insist: 
“The observation that such fossils are absent in Precambrian strata 
proves that these phyla arose in the Cambrian.””° 

It should be pointed out that this argument cannot be made for all 
Cambrian animal groups and, in my view, does not achieve the standing 
of a “proof? in any case. Many Cambrian phyla, including phyla 
characterized by mostly hard-shelled animals such as mollusks and 
echinoderms, do have soft-bodied representatives. The earliest known 
mollusk, Kimberella, for example, lacked a hard external shell (though it 
did have other hard parts).?! So, clearly, some mainly hard-shelled 
Cambrian groups could have had soft-bodied ancestors. 


It is also possible to postulate the existence of an arthropod or 
brachiopod ancestor—especially some extremely distant ancestor— 
lacking a hard shell. Soft-bodied onychophorans (velvet worms) were 
once proposed as ancestors of the arthropods, though more recent studies 
challenge this idea. Onychophorans themselves arise well after 
arthropods in the fossil record and cladistics analysis suggests that 
onychophorans may be a sister, rather than an ancestral, group to 
arthropods.”* Even so, it’s difficult to disprove a negative: in particular, 
to foreclose the possibility that arthropods or brachiopods might have 
had a soft-bodied ancestor deep in the Precambrian. 

Nevertheless, it seems unlikely on a Darwinian view of the history of 
life that all Cambrian arthropod or brachiopod ancestors, especially the 
relatively recent ancestors of these animals, would have lacked hard 
parts entirely. There are many types of arthropods that arise suddenly in 
the Cambrian—trilobites, Marrella, Fuxianhuia protensa, Waptia, 
Anomalocaris—and all of these animals had hard exoskeletons or body 
parts. Moreover, the only known extant group of arthropods without a 
hard exoskeleton (the pentastomids) have a parasitic relationship with 
arthropods that do.” Thus, surely, it seems likely that some of the near 
ancestors of the many arthropod animals that arose in the Cambrian 
would have left at least some rudimentary remains of exoskeletons in the 
Precambrian fossil record—if, in fact, such ancestral arthropods existed 
in the Precambrian and if arthropods arose in a gradual Darwinian way. 

Moreover, the arthropod exoskeleton is part of a tightly integrated 
anatomical system. Specific muscles, tissues, tendons, sensory organs— 
and a special mediating structure between the soft tissue of the animal 
and the exoskeleton called the endophragmal system—are all integrated 
to support the process of molting and exoskeletal growth and 
maintenance that is integral to the arthropod mode of existence. A best- 
case Darwinian scenario for the origin of such a system would, 
therefore, envision the “co-evolution” of these separate anatomical 
subsystems in a coordinated fashion, since some of these anatomical 
subsystems confer a functional advantage to the animal largely by 
supporting, and promoting, the growth and maintenance of the 


exoskeleton (and vice versa). Others would be vulnerable to damage 
without it. Thus, it seems unlikely that these interdependent subsystems 
would evolve independently first without an exoskeleton, only to have 
the exoskeleton arise suddenly as a kind of accretion atop an already 
integrated system of soft parts at the end of a long evolutionary process. 

This, again, makes it reasonable to expect that at least some 
rudimentary arthropod hard parts would have been preserved in the 
Precambrian if arthropods were present then. That such parts are 
unknown for all Cambrian arthropods (and brachiopods) in a fossil 
record that presumably favors hard-part preservation, seems at least 
curious. And it appears, on its face, to support the assertions of those 
Cambrian paleontologists such as Chen and Zhou who take the absence 
of any hard parts in the Precambrian record as evidence of the absence 
of those groups that typically depend on hard parts for their existence. 

In any case, advocates of the artifact hypothesis must at least explain 
a Cambrian explosion of hard body parts, if not whole Cambrian 
animals. As paleontologist George Gaylord Simpson noted in 1983, even 
if it’s true that Precambrian ancestors were not preserved simply because 
they lacked hard parts, “there is still a mystery to speculate about: Why 
and how did many animals begin to have hard parts—skeletons of sorts 
—with apparent suddenness around the beginning of the Cambrian?” 

There is an additional, more formidable, difficulty for this version of 
the artifact hypothesis. Although the fossil record generally does not 
preserve soft body parts as frequently as hard parts, it has preserved 
many soft-bodied animals, organs, and anatomical structures from both 
the Cambrian and the Precambrian periods. 

As we saw earlier, Precambrian sedimentary rocks in several places 
around the world have preserved fossilized colonial blue-green algae, 
single-celled algae, and cells with a nucleus (eukaryotes).*? These 
microorganisms were not only small, but they also entirely lacked hard 
parts. Another class of late Precambrian organisms called the Vendian or 
Ediacaran biota included the fossilized remains of many soft-bodied 
organisms, including many that may well have been lichens, algae, or 
protists (microorganisms with cells containing nuclei). Cambrian-era 


strata themselves preserve many soft-bodied creatures and structures. 
The Burgess Shale in particular preserved the soft parts of several types 
of hard-bodied Cambrian animals, such as Marrella splendens,*® 
Wiwaxia,*’ and Anomalocaris. The Burgess Shale also documents 
entirely soft-bodied representatives?’ of several phyla, including: 


e Cnidaria (represented by an animal called Thaumaptilon, a 
feather-shaped colonial organism formed from smaller soft sea 
anemone-like animals)?’ 

e Annelida (represented by the polychaete worms Burgessochaeta 
and Canadia)*® 

e Priapulida (represented by Ottoia, Louisella, Selkirkia—all 
worms with a distinctive proboscis)?! 

e Ctenophora (represented by Ctenorhabdotus, a gelatinous 
animal with a translucent body similar to a modern comb jelly)? 

e Lobopodia (represented by Aysheaia and Hallucigenia, 
segmented soft-bodied animals with many legs)?’ 


The Burgess also preserves soft-bodied animals of unknown 
affinities, such as Amiskwia, a gelatinous air mattress—like animal;** 
Eldonia, a jellyfish-like animal with a much more complex anatomy 
than a modern jellyfish;*> and the aforementioned, difficult to classify, 
Nectocaris.°° As Simon Conway Morris notes, “The existing [Burgess] 
collections represent approximately 70,000 specimens. Of these, about 
95 percent are either soft-bodied or have thin skeletons.”?7 


THE CHENGJIANG EXPLOSION 


Any doubts about the ability of sedimentary rocks to preserve soft and 
small body parts were permanently laid to rest by a series of dramatic 
fossil finds in southern China beginning in the 1980s. 

In June 1984, paleontologist Xian-Guang Hou journeyed to 
Kunming, in southern China, to prospect for fossilized samples of a 


bivalved arthropod called a bradoriid.** The area around Kunming in the 
Yunnan Province was well known for its lower Cambrian strata and 
typical Cambrian-era fossils, such as bradoriids and trilobites, both of 
which were relatively easy to preserve because of their characteristically 
hard exoskeletons. In 1980, Hou had found many bradoriid samples in a 
geological formation called the Qiongzhusi Section near Kunming. 

In the summer of 1984 Hou traveled to the town of Chengyjiang to 
look for bradoriids in another geological formation called the Heilinpu 
Formation. His efforts there yielded little success. As a result, he turned 
his attention to another outcrop, a sedimentary sequence now called the 
Maotianshan Shale. Hou’s team set farmworkers to digging out and 
scouring the mudstone blocks. His book, The Cambrian Fossils of 
Chengjiang, China, describes what happened next: 


At about three o’clock in the afternoon of Sunday July 1, a 
semicircular white film was discovered in a split slab, and was 
mistakenly thought to represent the valve of an unknown 
crustacean. With the realization that this . . . represented a 
previously unreported species, breaking of the rock in a search for 
additional fossils continued apace. With the find of another 
specimen, a 4—5 cm long animal with limbs preserved, it became 
apparent that here was nothing less than a soft-bodied biota.” 


Hou remembers the Cambrian specimen vividly, for it appeared “as if 
it was alive on the wet surface of the mudstone.’”*? Redoubling their 
efforts, the researchers quickly uncovered the fossilized remains of one 
extraordinary soft-bodied animal after another. Most of the fossils were 
preserved as flattened two-dimensional imprints of three-dimensional 
organisms, although, as Hou observes, “some retain a low three- 
dimensional relief.”*! Most important, he notes, “The remains of hard 
tissues, such as the shells of brachiopods or the carapaces of trilobites, 
are well represented in the Chengjiang fauna, but less robust tissues, 
which are usually lost through decomposition, are also beautifully 
preserved.’42 


As the result of the very fine, small-grained sediments in which they 
were deposited, the Chengjiang fossils preserved anatomical details with 
a fidelity surpassing even that of the Burgess fauna.*? The Maotianshan 
Shale also preserved an even greater variety of soft-bodied animals and 
anatomical parts than the Burgess Shale had done. In the ensuing years, 
Hou and his closest colleagues, J. Y. Chen and Gui-Qing Zhou, found 
many excellent examples of well-preserved animals lacking even a 
keratinized exoskeleton, including soft-bodied members of phyla such as 
Cnidaria (corals and jellyfish), Ctenophora (comb jellies), Annelida (a 
type of “ringed” segmented worm), Onychophora (segmented worms 
with legs), Phoronida (a tubular, filter-feeding marine invertebrate), and 
Priapulida (another distinctive type of worm).** (See Fig. 3.6.) 

They found fossils preserving the anatomical details of numerous soft 
tissues and organs such as eyes, intestines, stomachs, digestive glands, 
sensory organs, epidermes, bristles, mouths, and nerves.*° They also 
discovered jellyfish-like organisms called Eldonia, which exhibit 
delicate, soft body parts such as radiating water canals and nerve rings. 
Other fossils even revealed the contents of the guts of several animals.*° 

The discoveries near Chengjiang demonstrated beyond any 
reasonable doubt that sedimentary rocks can preserve soft-bodied fossils 
of great antiquity and in exquisite detail, thereby challenging the idea 
that the absence of Precambrian ancestors is a consequence of the fossil 
record’s inability to preserve soft-bodied animals from that period. At 
the same time, the sedimentary rocks near Chengjiang had other 
surprises in store. 


PRECAMBRIAN SECRETS 


Paul Chien is a Chinese-American marine biologist who, as a boy, left 
mainland China with his family to escape the 1949 Communist takeover 
under Mao Tse-tung. Eventually, after completing Ph.D. studies in the 
United States, he became a biology professor at the University of San 
Francisco. He first learned about the discoveries in southern China after 
reading the Time magazine cover story in 1995. Then he learned, 


ironically, from a story in Peoples Daily, the official newspaper of the 
Chinese Communist party, that some Chinese paleontologists thought 
that these discoveries challenged a Darwinian view of the history of life. 

The great variety of the marine invertebrates present in Chengjiang 
fascinated Chien and convinced him to return to the country of his birth. 
After making his first trip in the summer of 1996, he met J. Y. Chen. As 
Paul Chien returned over several successive summers to do research of 
his own, he and J. Y. Chen continued to share their findings and compare 
notes. 

Upon arriving in China in 1998 for his third summer of research, Paul 
Chien learned that Chen had discovered a fossil of an adult sponge in the 
late Precambrian rocks of a sedimentary formation called the 
Doushantuo Phosphorite—a formation that lies beneath the Maotianshan 
Shale. As the two scientists examined the sediments that encased Chen’s 
fossil sponge, they made a discovery that would doom the most popular 
remaining version of the artifact hypothesis. 






Figure 3.6c: 
The shrimp- 
like arthropod, 
Waptia. 
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Figure 3.6d: 
A priapulid 
worm with 
its distinctive 
proboscis. 


Figure 3.6a: A 
ciliated comb jelly, 
from the phylum 
Ctenophora. 


Figure 3.6e: 
The enigmatic 
jellyfish-like 
Eldonia. 





Figure 3.6f: 

A conical 
shelled animal 
from the 
phylum 
Hyolitha. 








Figure 3.6b: A filter- 
feeding worm-like 
member of the 
phylum Phoronida. 





FIGURE 3.6 

Cambrian explosion fossils from Chengjiang fauna, artist depictions and 
fossil photos. Photos in 3.6a, b, e, and f courtesy J. Y. Chen. Photos in 
3.6c and d courtesy Paul Chien. 


As J. Y. Chen began to examine the sedimentary rocks that enclosed 
his fossilized sponge, he decided to look at them in a so-called thin 
section under a light microscope. Chen wondered whether smaller 
embryonic forms of these Precambrian animals might also have been 
preserved in these phosphorite rocks. Sure enough, under magnification 
he found little round balls that he and Paul Chien identified as sponge 
embryos. In 1999, at a major international conference about the 
Cambrian explosion held near Chengjiang, J. Y. Chen, Paul Chien, and 
three other colleagues presented their findings.*’ 

A number of Chinese paleontologists questioned them at first, 
suggesting that the little round balls were not sponge embryos at all, but 
instead the remains of brown and green algae.“ Here Paul Chien’s 
expertise came to the fore. Early in his career, Chien had perfected a 
technique for examining the embryos of living sponges under a scanning 
electron microscope. He now adapted his technique to examine these 
microscopic fossilized structures using a more powerful microscope. 
What he found startled him and amazed other scientists. 

Sponges are nature’s glassworks. They are made of a soft and flexible 
lattice of cells from which protrude silica-encrusted “spicules.” Though 
sponges come in a variety of shapes and sizes, they are one of the 
simplest known forms of animal life, with between six and ten 
distinctive cell types.*? In comparison, the typical arthropod has between 
thirty-five and ninety cell types. 

As Chien examined the little balls of the Doushantuo Phosphorite 
under the scanning microscope, he noticed what looked like cells 
undergoing cell division. At first, he had no way to determine what type 
of cells they might be. But as he examined cross sections of these cells 
more carefully, he identified a distinctive structure that he knew from his 
prior research. 


Only sponges have spicules, and the fossilized cells he was 
examining preserved microscopic spicules in the early stages of their 
development.°° Clearly, these were not algal balls; they were sponge 
embryos. Even more surprising, Chien was able to observe the internal 
structure of these embryonic cells, allowing him to identify the nuclei of 
some of these cells within the fossilized remains of the larger outer cell 
membrane (see Fig. 3.7). 

The discovery of these sponge embryos has proven decisive in the 
case against the remaining versions of the artifact hypothesis, for several 
reasons. 

First, though spicules in sponges are encased in a thin layer of glassy 
silica, sponges are generally considered to be a soft-bodied organism 
because of the predominantly soft tissues out of which the rest of their 
bodies are made. Moreover, the cells of all embryos during their earliest 
embryonic stages are soft. Even in animals that have internal or external 
skeletons, the nascent forms of these hard parts do not emerge until 
gastrulation, partway through the process of embryological 
development. Thus, discovery of an embryo in the earliest stages of cell 
division shows beyond a doubt that Precambrian sedimentary rocks can, 
under the right circumstances, preserve soft-bodied organisms. 

It also established something else. J. Y. Chen found these sponge 
embryos beneath the Cambrian—Precambrian boundary in late 
Precambrian rock. Yet these Precambrian layers did not preserve 
remains of any clearly ancestral or intermediate forms leading to the 
other main groups of Cambrian animals. This raised an obvious 
question. If the Precambrian sedimentary strata beneath the Maotianshan 
Shale preserved the soft tissues of tiny, microscopic sponge embryos, 
why didn’t they also preserve the near ancestors of the whole animals 
that arose in the Cambrian, especially since some of those animals must 
have had at least some hard parts as a condition of their viability? If 
these strata could preserve embryos, then they should have preserved 
fully developed animals—at least, if such animals were present at the 
time. That well-developed, clearly ancestral animal forms were not 


preserved, when tiny sponge embryos’! were, strongly indicates that 
such forms were simply not present in the Precambrian layers. 
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FIGURE 3.7 

Figure 3.7a (above): Photographs of fossilized sponge embryos in the 
early stages of cell division, showing the sponge at the 8-cell stage of 
division, with four of its cells marked in the foreground. Figure 3.7b 
(right): A close-up image of a fossil of a sponge embryo cell revealing 
numerous yolk granules inside. Courtesy Paul Chien. 


Of course, there are conditions under which fossils are unlikely to be 
preserved. We know, for example, that near-shore sands do not favor 
preservation of detail, let alone the fine detail of very small organisms a 
millimeter or less in length.°? Even so, such considerations do little to 
bolster the artifact hypothesis. The sedimentary environments that 
produced the carbonates, phosphorites, and shales of the Precambrian 
strata beneath the Maotianshan Shale, for example, would have provided 
a congenial environment for fossilizing all sorts of creatures during 
Precambrian times. 


Advocates of the artifact hypothesis need to show not just that certain 
factors discourage preservation in general. No one disputes that. What 
they need to show is that these factors were ubiquitous in Precambrian 
depositional environments worldwide. If near-shore sands characterized 
all Precambrian sedimentary deposits, then paleontologists would not 
expect to find any fossils there, at least not any tiny ones. Yet clearly this 
is not the case. Precambrian strata include many types of sediments that 
can preserve—and in the case of the Doushantuo formation in China, 
have preserved—animal remains in fine detail, including small and 
vulnerable sponge embryos. 

Moreover, geologists Mark and Dianna McMenamin have noted that 
in many other Cambrian locales around the world, including one in 
Newfoundland that they have studied extensively, the pattern of 
sedimentation changes very little across the Cambrian—late Precambrian 
boundary, suggesting that many Precambrian environments would have 
provided equally good environments for the preservation of fossils.>° 

In their 2013 book, The Cambrian Explosion, paleontologists James 
Valentine and Douglas Erwin go further. They note that many late 
Precambrian depositional environments actually provide more favorable 
settings for the preservation of fossils than those present in the Cambrian 
period. As they note, “a revolutionary change in the sedimentary 
environment—from microbially stabilized sediments during the 
Ediacaran [late Precambrian] to biologically churned sediments as 
larger, more active animals appeared—occurred during the early 
Cambrian. Thus, the quality of fossil preservation in some settings may 
have actually declined from the Ediacaran to the Cambrian, the opposite 
of what has sometimes been claimed, yet we find a rich and widespread 
explosion of [Cambrian] fauna.”>* 


STATISTICAL PALEONTOLOGY 


Recent work in a field known as statistical paleontology casts further 
doubt on the artifact hypothesis. Since the discovery of the Burgess 
Shale, Precambrian- and Cambrian-era discoveries have repeatedly 


uncovered fossil forms that either establish radically disparate new 
forms of life or, increasingly, forms that fall into existing higher 
taxonomic groups (such as class, subphylum, or phylum). 

As a result, the fossil record amply documents organisms 
corresponding to the terminal branches on the Darwinian tree of life 
(animal forms representing new phyla or classes, for example), but it 
fails to preserve those organisms representing the internal branches or 
nodes leading to these representatives of novel phyla and classes of 
Cambrian-era animals. Yet these intermediates are the very forms 
required to connect the terminal branches to form a coherent 
evolutionary tree and establish that the representatives of the Cambrian 
animals did arise by means of a gradual evolutionary process from 
simpler Precambrian ancestors. 

Recall that Louis Agassiz thought that this pattern could not be 
explained by appealing to an incomplete fossil record, because the fossil 
record was strangely selective in its incompleteness, preserving 
abundant evidence of the terminal branches but consistently neglecting 
to preserve the representatives of the internal branches or nodes. 

Contemporary paleontologists, such as Michael Foote at the 
University of Chicago, have come to a similar conclusion. Foote has 
shown, using statistical sampling analysis, that as more and more fossil 
discoveries fall within existing higher taxonomic groups (e.g., phyla, 
subphyla, and classes), and as they fail to document the rainbow of 
intermediate forms expected in the Darwinian view of the history of life, 
it grows ever more improbable that the absence of intermediate forms 
reflects a sampling bias—that is, an “artifact” of either incomplete 
sampling or preservation. 

This kind of analysis merely quantifies what, in other circumstances, 
we would sense intuitively. Imagine that you reach into an enormous 
barrel full of marbles and randomly pull out a yellow, a red, and a blue 
marble. At this point your brief sampling should leave you undecided as 
to whether you have a representative sample of the barrel’s contents. 
You might at first imagine that the barrel also contains marbles 
representing a rainbow of intermediate colors. But as you continue to 


sample from every place in the barrel and find that the barrel disgorges 
only those same three colors you begin to suspect that it may offer a 
much more limited selection of colors than, say, the rack of color 
samples at your local paint store. 

Over the past 150 years or so, paleontologists have found many 
representatives of the phyla that were well known in Darwin’s time (by 
analogy, the equivalent of the three primary colors) and a few 
completely new forms altogether (by analogy, some other distinct colors 
such as green and orange, perhaps). And, of course, within these phyla 
there is a great deal of variety. Nevertheless, the analogy holds at least 
insofar as the differences in form between any member of one phylum 
and any member of another phylum are vast, and paleontologists have 
utterly failed to find forms that would fill these yawning chasms in what 
biologists call “morphological space.” In other words, they have failed 
to find the paleontological equivalent of the numerous finely graded 
intermediate colors (Pendleton blue, dusty rose, gun barrel gray, 
magenta, etc.) that interior designers covet. Instead, extensive sampling 
of the fossil record has confirmed a strikingly discontinuous pattern in 
which representatives of the major phyla stand in stark isolation from 
members of other phyla, without intermediate forms filling the 
intervening morphological space. 

Foote’s statistical analysis of this pattern, documented by an ever 
increasing number of paleontological investigations, demonstrates just 
how improbable it is that there ever existed a myriad of as yet 
undiscovered intermediate forms of animal life—forms that could close 
the morphological distance between the Cambrian phyla one tiny 
evolutionary step at a time. In effect, Foote’s analysis suggests that since 
paleontologists have reached repeatedly into the proverbial barrel, 
sampled it from one end to the other, and found only representatives of 
various radically distinct phyla but no rainbow of intermediates, we 
shouldn’t hold our breath expecting such intermediates to eventually 
emerge. He asks “whether we have a representative sample of 
morphological diversity and therefore can rely on patterns documented 
in the fossil record.” The answer, he says, is yes.>° 


By this affirmation, he doesn’t mean that there are no biological 
forms left to discover. He means, rather, that we have good reason to 
conclude that such discoveries will not alter the largely discontinuous 
pattern that has emerged. “Although we have much to learn about the 
evolution of form,” he writes, the statistical pattern created by our 
existing fossil data demonstrates that “in many respects our view of the 
history of biological diversity is mature.”>° 


CHENGJIANG AND THE CAMBRIAN CONUNDRUM?’ 


The Cambrian and Precambrian fossils from southern China have 
rendered the mystery associated with the Cambrian explosion more 
acute in other ways as well. First, the fossil finds in southern China 
coupled with advances in radiometric dating techniques applied to other 
Cambrian-era strata have allowed scientists to reassess the duration of 
the Cambrian explosion. As the name implies, the fossils documenting 
the Cambrian explosion appear within a relatively narrow slice of 
geologic time. Until the early 1990s, most paleontologists thought the 
Cambrian period began 570 million and ended 510 million years ago, 
with the Cambrian explosion of novel animal forms occurring within a 
20- to 40-million-year window during the lower Cambrian period. 

Two developments have led paleontologists and geochronologists to 
revise those estimates downward. First, in 1993, radiometric dating of 
zircon crystals from formations just above and below Cambrian strata in 
Siberia allowed for a precise redating of Cambrian strata. Radiometric 
analyses of these crystals fixed the start of the Cambrian period at 
544 million years ago,’ and the beginning of the Cambrian explosion 
itself to about 530 million years ago (see Fig. 3.8). These studies also 
suggested that the explosion of the novel Cambrian animal forms 
occurred within a window of geologic time much shorter than previously 
believed, lasting no more than 10 million years, and the main “period of 
exponential increase of diversification” lasting only 5 to 6 million 
years.>” 
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FIGURE 3.8 
The Cambrian explosion occurred within a narrow window of geological 
time. 


Geologically speaking, 5 million years represents a mere 1/10 of 1 
percent (0.11 percent, to be precise) of earth’s history. J. Y. Chen 
explains that “compared with the 3-plus-billion-year history of life on 
earth, the period [of the explosion] can be likened to one minute in 24 
hours of one day.” 

Some geologists or evolutionary biologists dispute these numbers, 
but they do so by redefining the Cambrian explosion as a series of 
separate events rather than using the term to refer to the main radiation 
of new body plans in the lower Cambrian. In 2009, I participated in a 
debate in which one of my opponents, paleontologist Donald Prothero, 
from Occidental College, used this common rhetorical strategy to 
minimize the severity of the Cambrian mystery. In his opening 
statement, he claimed that the Cambrian explosion actually took place 
over an 80-million-year period of time and that consequently those who 
cited the Cambrian as a challenge to the adequacy of the neo-Darwinian 
theory were mistaken. As I was listening to his opening statement, I 
consulted his textbook to see how he had derived his 80-million-year 
figure. Sure enough, he had included in the Cambrian explosion three 


separate pulses of new innovation or diversification, including the origin 
of a group of late Precambrian organisms called the Ediacaran or 
Vendian fauna. He also included not only the origin of the animal body 
plans in the lower Cambrian, but also the subsequent minor 
diversification (variations on those basic architectural themes) that 
occurred in the upper Cambrian. He included, for example, not just the 
appearance of the first trilobites, which occurred suddenly in the lower 
Cambrian, but also the origin of a variety of different trilobite species 
later from the upper Cambrian. 

In my response to Prothero, I noted that he was, of course, free to 
redefine the term “Cambrian explosion” any way he liked, but that by 
using the term to describe several separate explosions (of different 
kinds), he had done nothing to diminish the difficulty of explaining the 
origin of the first explosive appearance of the Cambrian animals with 
their unique body plans and complex anatomical features. Beyond this, 
as we'll see in the next chapter, the Vendian organisms may not have 
been animals at all, and they bear little resemblance to any of the 
animals that arise in the Cambrian. We’ll also see that most, if not all,°! 
of these organisms actually went extinct well before the origin of the 
animals that first appear in the lower Cambrian and so they do little to 
minimize the problem of the explosive origin of animals. 

In any case, expanding the definition of the Cambrian explosion only 
obscures the real challenge posed by the event, a challenge underscored 
by the discoveries at Chengjiang. An analysis by MIT geochronologist 
Samuel Bowring has shown that the main pulse of Cambrian 
morphological innovation occurred in a sedimentary sequence spanning 
no more than 6 million years.°” Yet during this time representatives of at 
least sixteen completely novel phyla and about thirty classes first 
appeared in the rock record. In a more recent paper using a slightly 
different dating scheme, Douglas Erwin and colleagues similarly show 
that thirteen new phyla appear in a roughly 6-million-year window.®* As 
we’ve seen, among these animal forms were the first trilobites, with their 
lens-focusing compound eyes among other complex anatomical features. 
The problem of explaining how so many new forms and structures arose 


so rapidly in the first explosive period of the Cambrian remains, whether 
or not one decides to include within the designation “Cambrian 
explosion” other distinct events (see Part Two). 


THE TOP-DOWN PATTERN IN STARK RELIEF 


The Chengjiang fauna makes the Cambrian explosion more difficult to 
reconcile with the Darwinian view for yet another reason. The 
Chengjiang discoveries intensify the top-down pattern of appearance in 
which individual representatives of the higher taxonomic categories 
(phyla, subphyla, and classes) appear and only later diversify into the 
lower taxonomic categories (families, genera, and species). 

Discoveries at Chengjiang contradict the bottom-up pattern that neo- 
Darwinism expects. The site does not show the gradual emergence of 
unique species followed slowly but surely by the emergence of 
representatives of ever higher and more disparate taxa, leading to novel 
phyla. Instead, like the Burgess Shale, it shows body plan—level disparity 
arising first and suddenly, with no evidence of a gradual unfolding and 
ranging through the lower taxonomic groups. 

Consider the case of the early chordates, a phylum consisting of 
creatures possessing a flexible, rod-shaped structure called a notochord. 
Mammals, fish, and birds are familiar members of this phylum. Prior to 
the discovery of the Chengjiang biota, chordates were unknown in the 
Cambrian period and were thought to have first appeared only much 
later, during the Ordovician period. Now, following the discoveries at 
Chengjiang, the first appearance of chordates in the Cambrian period has 
been amply documented. 

For example, J. Y. Chen and several other Chinese paleontologists 
have found a spindle-shaped eel-like animal called Yunnanozoon 
lividum, which many paleontologists have interpreted as an early 
chordate because it possesses, among other features, a digestive tract, 
branchial arches, and a large notochord much like a spinal cord.® In 
addition, J. Y. Chen and colleagues have reported the discovery of a 
sophisticated craniate-like chordate called Haikouella lanceolata from 


the lower Cambrian Maotianshan Shale. According to Chen and others, 
Haikouella has many of the same features of Yunnanozoon lividum as 
well as several additional anatomical features including a “heart, ventral 
and dorsal aorta, an anterior branchial arterial, gill filaments, a caudal 
(posterior) projection, a neural cord with a relatively large brain, a head 
with possible lateral eyes, and a ventrally situated buccal cavity with 
short tentacles.”°° 





FIGURE 3.9 

Figure 3.9a (top): Drawing of the Cambrian fish, Myllokunmingia 
fengjiaoa. Figure 3.9b (bottom): Photograph of Myllokunmingia 
fengjiaoa fossil. Reprinted by permission from Macmillan Publishers 
Ltd.: Nature, Shu et al., “Lower Cambrian Vertebrates from South 
China,” Nature, 402 (November 4, 1999): 42—46. Copyright 1999. 


Simon Conway Morris, with D. G. Shu and several Chinese 
colleagues, has reported an even more dramatic find. They have 
discovered the fossilized remains of two small Cambrian fish, 
Myllokunmingia fengjiaoa and Haikouichthys ercaicunensis (see Fig. 
3.9), suggesting a much earlier appearance for both fishes and 
vertebrates (a class of chordates), both of which were first thought to 
have originated in the Ordovician period, about 475 million years ago. 
Both of these taxa are jawless fish (agnathans) and are considered by 
Shu and others to be closely allied to modern lampreys.°’ Finally, a 
paper by Shu and others reports the first convincing specimen of another 
type of chordate from the Cambrian, a urochordate (tunicate). This 
specimen, Cheungkongella ancestralis, is likewise found in the early 
Cambrian shales (Qiongzhusi Formation) near Chengjiang. These recent 
finds demonstrate that not only did the phylum Chordata first appear in 
the Cambrian, but also each one of the chordate subphyla 
(Cephalochordata, Craniata, and Urochordata) first emerged then as 
well. In any case, the discovery in China of chordates, and other 
previously undiscovered phyla in the Cambrian, only accentuates the 
puzzling top-down pattern of appearance that other Cambrian 
discoveries had previously established.°? 


MORE QUESTIONS THAN ANSWERS 


Thus, despite the efforts to explain the Cambrian explosion using 
various versions of the artifact hypotheses, the mystery of the Cambrian 
explosion has only grown more acute as a result of the dramatic 


discoveries in southern China that have turned Darwin’s tree of life 
upside down. 

When I first heard J. Y. Chen describe these discoveries in 2000, I 
had been investigating another unsolved question about the history of 
life: What caused the first living cell, and the information it contains, to 
arise? As I heard Dr. Chen speak that day in Seattle, my interest in 
another puzzling question about the history of life began to germinate. 
Could it be that the origin of animal life was in its own way just as 
difficult a problem as that of the origin of life itself? Though I eventually 
concluded that the Cambrian explosion does, indeed, present a profound 
challenge to contemporary Darwinian theory, it didn’t take me long to 
discover that some scientists believed that the mystery of the Cambrian 
explosion had already been resolved by the discovery of some rather 
enigmatic Precambrian fossils. We turn to those next. 


4 
The Not Missing Fossils? 


The atmosphere in the auditorium of the Sam Noble Science Museum at 
the University of Oklahoma was uncomfortably tense, with a security 
detail of local Norman, Oklahoma, police officers on hand to keep the 
peace—a conspicuous change from the campus security guard who 
might be present at a typical university event. The occasion? A new 
documentary, Darwins Dilemma, that Jonathan Wells, a colleague of 
mine from the Discovery Institute, and I were scheduled to show. The 
film would explore the challenge to Darwin’s theory posed by the 
Cambrian fossil record. 

For weeks before our event in September of 2009, outspoken 
evolutionary biology students and an atheist student group, both egged 
on by militant off-campus bloggers, had threatened to disrupt the 
screening. Members of the biology faculty pledged to come, so that well 
before the official start time a large crowd had gathered. 

The museum and the geology department, not wanting to complicate 
matters in their own minds by watching the film first, decided to launch 
a preemptive first strike by issuing a disclaimer and scheduling an 
official lecture designed to rebut the film. In the disclaimer the museum 
stated that, given its public funding, it had no choice but to rent the 
auditorium to groups irrespective of their “religious beliefs” or 
“scientific literacy.” The disclaimer further noted that the Sam Noble 
Science Museum did “not support unscientific views masquerading as 
science, such as those of the Discovery Institute.” The museum flyer 
also announced the lecture of one of their curators, a paleontologist from 
the university, and mocked the topic of the film—the Cambrian 


“explosion”—with carefully placed scare quotes. The lecture’s start time 
of 5:00 PM. also ensured a confluence afterward of the audience made 
upset by the lecture and the more friendly audience coming to see the 
film. 

Jonathan Wells, a biologist well known for his skepticism about 
contemporary Darwinian theory, attended the pre-film lecture. Ignoring 
a few hostile glares, he listened as the paleontologist from the university 
argued that the Cambrian explosion presented no actual dilemma for 
Darwinian evolution, and as this same paleontologist speculated that if 
Darwin had only known what paleontologists today know of the 
Cambrian fossil record, he (Darwin) would have celebrated it as 
confirmation of his theory. This particular paleontologist also denied that 
novel animal forms emerged suddenly in the Cambrian. Instead, he 
argued that they arose in rudimentary form much earlier in the late 
Precambrian. He noted that paleontologists had discovered in late 
Precambrian sediments fossilized sponges, a type of primitive mollusk, 
and the burrows of worms. 

He also laid particular stress on the significance of a group of 
enigmatic organisms first discovered in the Ediacaran Hills in southern 
Australia dating from about 565 million years ago, in a Precambrian 
period known as the Vendian or Ediacaran. Jonathan and I were well 
aware that most paleontologists do not regard these fossilized organisms 
as plausible ancestors to the Cambrian fauna. But on that evening the 
expert from the university claimed the opposite. He also claimed that 
some obscure Ediacaran organisms (with exotic names such as 
Vernanimalcula, Parvancorina, and Arkarua) represented early 
bilaterians (bilaterally symmetrical animals), arthropods, and 
echinoderms. He insisted that these organisms pushed back the 
explosion of animal life by some 40 million years, establishing a “fuse” 
for the Cambrian explosion in the form of primitive and presumably 
ancestral animal forms for several of the most significant Cambrian 
phyla and body designs. 

An hour before I was to walk over to lead a discussion and answer 
questions about the Cambrian film from what proved to be an intensely 


hostile audience, Jonathan Wells called me with a report on the 
museum’s attempt to refute the film preemptively. The presentation 
claimed to have resolved the Cambrian mystery—Darwin’s dilemma— 
by showing that the ancestral precursors to the major groups of 
Cambrian animals had been found after all. But is this true? 


THE EDIACARAN FAUNA AND VENDIAN RADIATION 


In the previous chapter, we saw that many prominent paleontologists 
have sought to explain the Cambrian explosion as an artifact of our 
incomplete sampling of an incomplete fossil record. The lecture that my 
colleague heard that night in Oklahoma took a very different approach, 
giving the strong impression that the Precambrian fossil record actually 
does preserve the ancestral forms of the Cambrian animals and that the 
Ediacaran fauna, in particular, provide several striking examples of such 
forms. 

In public presentations about the Cambrian explosion, I’ve often 
encountered this claim, though usually in the form of an unfocused 
question: “What about the Ediacaran?” Nevertheless, in writing about 
the Cambrian, I take care not to attribute the idea that the Ediacaran 
fauna represent Cambrian ancestors to leading Ediacaran or Cambrian 
experts, lest I critique a straw man. Most paleontologists doubt that well- 
known Ediacaran forms represent ancestors of the Cambrian animals and 
few think the late Precambrian fossil record as a whole makes the 
Cambrian explosion appreciably less explosive. The claim is important 
to address, however, since it persists as a kind of paleontological urban 
legend, one that even occasionally finds its way into the mouths of 
paleontologists. 

The Ediacaran fauna derive their name from their most notable 
discovery site, the Ediacaran Hills in the outback of southeastern 
Australia. These fauna date from late Precambrian time, a period that the 
International Union of Geological Sciences has recently renamed the 
“Ediacaran period.”! Since geologists used to call the last period of 
Precambrian time the “Vendian period,” paleontologists also refer to the 


Ediacaran fauna as the Vendian fauna or biota (see Fig. 1.6). 
Paleontologists have made additional discoveries of Edicaran- or 
Vendian-era creatures in England, Newfoundland, the White Sea in 
northwestern Russia, and the Namibian desert in southern Africa, 
suggesting a near worldwide distribution. Although these fossils were 
originally dated to between 700 and 640 million years old, volcanic ash 
beds both below and above the Namibian site have recently provided 
more accurate radiometric dates. These studies fix the date for the first 
appearance of the Ediacaran fauna at about 570—565 million years ago, 
and the last appearance at the Cambrian boundary about 543 million 
years ago, or about 13 million years before the onset of the Cambrian 
explosion itself.” 

The late Precambrian-era sediments around the world have yielded 
four main types of fossils, all of which are dated between about 570 and 
543 million years ago. The first group consists of the Precambrian 
sponges mentioned in the previous chapter. These animals first arose 
about 570 to 565 million years ago. 
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FIGURE 4.1 

Examples of enigmatic Ediacaran fossils: Dickinsonia, Spriggina, and 
Charnia. Fossil photos in Figures 4.1a and 4.1b courtesy Peterson, K. J., 
Cotton, J. A., Gehling, J. G., and Pisani, D., “The Ediacaran Emergence 
of Bilaterians: Congruence Between the Genetic and the Geological 
Fossil Records,” Philosophical Transactions of the Royal Society B, 
2008, 363 (1496): 1435—43, Figure 2, by permission of the Royal 
Society. 


The second is the distinctive group of fossils from the Ediacaran 
Hills. The creatures fossilized there include such well-known forms as 
the flat, air mattress-like body of Dickinsonia; the enigmatic Spriggina, 
with its elongated and segmented body and possible head shield; and the 
frond-like Charnia (see Fig. 4.1). These organisms were at least mostly 
soft-bodied and are large enough to identify with the naked eye. 

The third group includes what are called trace fossils, the possible 
remains of animal activity such as tracks, burrows, and fecal pellets. 
Some paleontologists have attributed these trace fossils to ancient 
worms. 

The fourth group is the fossils of what may be primitive mollusks, a 
possibility that recetved support from a recent discovery in the cliffs 
along the White Sea in northwestern Russia. There, Russian scientists 
have discovered thirty-five distinctive specimens of a possible mollusk 
called Kimberella, probably a simple animal form. These new White Sea 
specimens, dated to 550 million years ago, suggest that Kimberella “had 
a strong [though not hard], limpet-like shell, crept along the sea floor, 
and resembled a mollusk.” Paleontologist Douglas Erwin, of the 
Smithsonian Institution, has commented: “It’s the first animal that you 
can convincingly demonstrate is more complicated than a flatworm.”* 
Additionally, seafloor tracks from Precambrian sediments in both 
Canada and Australia have been attributed to mollusks, since the tracks 
resemble what might have been left by a row of small teeth on the 
tongue-like ribbon of some mollusks as they scraped food particles off 
the seafloor. In this case, Kimberella may well have been the track 


maker.” The authors of the original descriptive paper in Nature, Mikhail 
Fedonkin, from the Russian Academy of Sciences, and Benjamin 
Waggoner, then at the University of California at Berkeley, conclude as 
much and suggest that such creatures “began to diversify before the 
beginning of the Cambrian.” Paleontologists, however, are still 
weighing the evidence.’ 


THE SIGNIFICANCE OF THE EDIACARAN 


So do either the remains of the specific organisms from the Ediacaran 
Hills or the Ediacaran or Vendian biota as a whole solve the problem of 
the Cambrian explosion? Do these exotic forms represent a kind of fuse 
to the Cambrian explosion that eliminates the need to explain the rapid 
emergence of novel body plans and forms of animal life? There are 
many good reasons to doubt this idea. 

First, with the exception of sponges and the possible exception of 
Kimberella, the body plans of visibly fossilized organisms (as opposed 
to trace fossils) bear no clear relationship to any of the organisms that 
appear in the Cambrian explosion (or thereafter). The most noted 
Ediacaran organisms such as Dickinsonia, Spriggina, and Charnia do 
not have an obvious head, a mouth, bilateral symmetry (see below), a 
gut, or sense organs such as eyes. Some paleontologists question 
whether these organisms even belong in the animal kingdom. 

Dickinsonia, for example, has been interpreted by University of 
Oregon paleontologist Gregory Retallack as having “fungal-lichen” 
affinities, since its mode of fossil preservation “is comparable not with 
that of soft-bodied jellyfish, worms, and cnidarians, but with the fossil 
record of fungi and lichens.” Dickinsonia’s taxonomic position, 
Retallack notes, has long been an unsolved puzzle. “Biological affinities 
of Dickinsonia remain problematic,” he writes, since it has been 
“variously considered a polychaete, turbellarian or annelid worm, 
jellyfish, polyp, xenophyophoran protist, lichen or mushroom.” 

Similar disputes have characterized attempts to classify Spriggina. In 
1976, Martin Glaessner, the first paleontologist to study the Ediacaran in 


detail, described Spriggina as a possible annelid polychaete worm based 
largely upon its segmented body. Nevertheless, Simon Conway Morris 
later rejected that hypothesis because Spriggina shows no evidence of 
the distinguishing “chaetes,” leg-like bristled protrusions that polychaete 
worms possess. Glaessner himself later repudiated his original 
hypothesis that Spriggina was ancestral to polychaetes, noting that 
Spriggina “cannot be considered as a primitive polychaete, having none 
of the possible ancestral characters indicated . . . by specialists on the 
systematics and evolution of this group.” ° 

In 1981, paleontologist Sven Jorgen Birket-Smith produced a 
reconstruction of a Spriggina fossil showing that it possessed a head and 
legs similar to those of trilobites, though examinations of subsequent 
Spriggina specimens have shown no evidence of it possessing limbs of 
any kind.!! In 1984, Glaessner weighed in on this discussion as well. He 
argued that “Spriggina shows no specific characters of the arthropods, 
particularly of the trilobites.”!* He also noted that the body segmentation 
of Spriggina, and “its known appendages are at the level of polychaete 
annelids”!> (although, as noted, by this time he had rejected Spriggina as 
a possible polychaete ancestor). Instead, he proposed that Spriggina 
represented a side branch on the animal tree of life—one that resulted, 
“metaphorically” perhaps, in “an unsuccessful attempt to make an 
arthropod.” 

In a presentation to the Geological Society of America in 2003, 
geologist Mark McMenamin revived the idea that Spriggina might 
represent a trilobite ancestor. He argued that several features present in 
Spriggina fossils are comparable to those in trilobites such as “the 
presence of genal spines” and an effaced head or “cephalic region.”!4 
Nevertheless, many Ediacaran experts, including McMenamin, have also 
noted that Spriggina specimens show no evidence of eyes, limbs, 
mouths, or anuses, most of which are known from fossil trilobites. !> 
Other paleontologists remain skeptical about whether Spriggina does in 
fact exhibit genal spines, noting that good specimens seem to show 
relatively smooth edges with no protruding spines.'° In addition, 
analysis of the best specimens of Spriggina shows that it does not 


exhibit bilateral symmetry, undermining earlier attempts to classify it as 
a bilaterian animal, and by implication an arthropod.'’ Instead, 
Spriggina exhibits something called “glide symmetry” in which the body 
segments on either side of its midline are offset rather than aligned.!® As 
geologist Loren Babcock of Ohio State University notes, “The zipper- 
like body plans of some Ediacaran (Proterozoic) animals such as 
Dickinsonia and Spriggina involve right and left halves that are not 
perfect mirror images of each other.”!? The lack of such symmetry, a 
distinctive feature of all bilaterian animals, and the absence in Spriggina 
specimens of many other distinguishing features of trilobites, has left the 
classification of this enigmatic organism uncertain. 

Paleontologists James Valentine, Douglas Erwin, and David 
Jablonski distill the confusing welter of conflicting views about the 
Ediacaran fossils: “Although the soft-bodied fossils that appear about 
565 million years ago are animal-like, their classifications are hotly 
debated. In just the past few years these fossils have been viewed as 
protozoans; as lichens; as close relatives of the cnidarians; as a sister 
group to cnidarians plus all other animals; as representatives of more 
advanced, extinct phyla; and as representatives of a new kingdom 
entirely separate from the animals.”2° What’s more, Valentine, Erwin, 
and Jablonski note that those paleontologists who do regard the 
Ediacaran fauna as animals rarely classify them the same way, 
underscoring their lack of clear affinities to any known animal groups. 
As they note, “Still other specialists have parceled the fauna out among 
living phyla, with some assigned to the Cnidaria and others to the 
flatworms, annelids, arthropods and echinoderms.”?! The uncertain 
standing of these fossilized forms is partly due to their early extinction, 
but it also stems from an absence of defining characteristics shared with 
known groups. They conclude: “This confusing state of affairs arose 
because these body fossils do not tend to share definitive anatomical 
details with modern groups, and thus the assignments must be based on 
vague similarities of overall shape and form, a method that has 
frequently proved misleading in other cases.” 


Other leading paleontologists also doubt that the Cambrian animals 
descended from these Ediacaran forms. In a phylogenetic diagram 
showing the evolutionary relationship of Precambrian and Cambrian 
fossils, Oxford biologists Alan Cooper and Richard Fortey depict the 
Ediacaran fauna as lying on a line of descent separate from the 
Cambrian animals rather than being ancestral to them.?? In another 
paper, Fortey asserts that the beginning of the Cambrian “saw the sudden 
appearance in the fossil record of almost all the main types of animals 
(phyla) that still dominate the biota today.” He concedes that there are a 
variety of fossils in older strata, but insist that “they are either very small 
(such as bacteria and algae) or their relationships to the living fauna are 
highly contentious, as is the case with the famous soft-bodied fossils 
from the late Precambrian Pound Quartzite, Ediacara, South 
Australia.””4 

Similarly, paleontologist Andrew Knoll and biologist Sean B. Carroll 
have argued: “It is genuinely difficult to map the characters of Ediacaran 
fossils onto the body plans of living invertebrates.” Although many 
paleontologists initially showed interest in the possibility that the 
Cambrian animal forms might have evolved from the Ediacaran 
organisms, paleontologist Peter Ward explains that “later study cast 
doubt on the affinity between these ancient remains preserved in 
sandstones [the Australian Ediacaran] and living creatures of today” 
(that is, animals representing phyla that first arose in the Cambrian).*° 
As Nature recently noted, if the Ediacaran fauna “were animals, they 
bore little or no resemblance to any other creatures, either fossil or 
extant.”?/ 

This absence of clear affinities has led an increasing number of 
paleontologists to reject ancestor-descendant relationships between all 
but (at most) a few of the Ediacaran and Cambrian fauna. Nevertheless, 
some have suggested that trace fossils may establish a link. In an 
authoritative 2011 paper in the journal Science, Douglas Erwin and 
colleagues described the discovery of Ediacaran trace fossils consisting 
of surface tracks, burrows, fecal pellets, and feeding trails, which, they 
argue, though small, could only have been made by animals such as 


worms with a relatively high degree of complexity.” On the basis of 
these findings, Erwin and other paleontologists have argued that these 
trace fossils suggest the existence of organisms with a head and tail, 
nervous systems, a muscular body wall allowing creeping or burrowing, 
and a gut with mouth and anus.”? Other paleontologists suggest that 
these characteristics may indicate the presence of a Precambrian mollusk 
or a worm phylum.*? 

Graham Budd, a British paleontologist who works at Uppsala 
University in Sweden, and others, have disputed these associations. 
Budd and geologist colleague Sören Jensen argue that many alleged 
trace fossils actually show evidence of inorganic origin: “There are 
numerous reports of older trace fossils, but most can be immediately 
shown to represent either inorganic sedimentary structures or metaphytes 
[land plants], or alternatively they have been misdated.”?! Still others 
have suggested that surface tracks and trails could have been left by 
mobile single-celled organisms, including a known form of a giant deep- 
sea protist that leaves bilaterian-like impressions. As one paper explains, 
“Some such traces date back to 1.5 billion to 1.8 billion years ago, which 
outdates even the boldest claims of the time of origin of animal multi- 
cellularity and forces researchers to contemplate the possibility of an 
inorganic or bacterial origin.”?? 

Even the most favorable interpretations of these trace fossils suggest 
that they indicate the presence of no more than two animal body plans 
(of largely unknown characteristics). Thus, the Ediacaran record falls far 
short of establishing the existence of the wide variety of transitional 
intermediates that a Darwinian view of life’s history requires. The 
Cambrian explosion attests to the first appearance of organisms 
representing at least twenty phyla and many more subphyla and classes, 
each manifesting distinctive body plans. In a best case, the Ediacaran 
forms represent possible ancestors for, at most, four distinct Cambrian 
body plans, even counting those documented only by trace fossils. This 
leaves the vast majority of the Cambrian phyla with no apparent 
ancestors in the Precambrian rocks (1.e., at least nineteen of the twenty- 


three phyla present in the Cambrian have no representative in 
Precambrian strata).>° 

Third, even if representatives of four animal phyla were present in the 
Ediacaran period, it does not follow that these forms were necessarily 
transitional or intermediate to the Cambrian animals. The Precambrian 
sponges (phylum Porifera), for example, were quite similar to their 
Cambrian brethren, thus demonstrating, not a gradual transformation 
from a simpler precursor or the presence of an ancestor common to 
many forms, but quite possibly only an earlier first appearance of a 
known Cambrian form. The same may be true of whatever kind of worm 
may be attested by Precambrian tracks and burrows. 

Moreover, even assuming, as some evolutionary biologists do,** that 
later Cambrian animals had a sponge-like Precambrian ancestor, the gap 
in complexity as measured by the number of cell types alone, to say 
nothing of the specific anatomical structures and distinct modes of body 
plan organization that are present in later animals but not in sponges, 
leaves a massive discontinuity in the fossil record that requires 
explanation (much like the morphological gap between Spriggina and 
actual arthropods). 


AN EDIACARAN MINI-EXPLOSION 


The Ediacaran fossils themselves provide evidence of a puzzling leap in 
biological complexity, though not one nearly great enough (or of the 
right kind) to account for the Cambrian explosion. Before organisms like 
Kimberella, Dickinsonia, and sponges appeared, the only living forms 
documented in the fossil record for over 3 billion years were single- 
celled organisms and colonial algae. Producing sponges, worms, and 
mollusks from single-celled organisms is a little like transforming a 
spinning top into a bicycle. The bicycle isn’t remotely as complex as the 
automobile sitting beside it, but it represents an enormous leap in 
technological sophistication over the spinning top. Likewise, although 
the humble Ediacaran biota look simple beside most of the Cambrian 


animals, they represent an enormous leap in functional complexity over 
the single-celled organisms and colonial algae that preceded them. 

Thus, the Ediacaran biota attest to a separate sudden increase in 
biological complexity within a short window of geological time (about 
15 million years), following roughly 3 billion years in which only 
single-celled organisms inhabited the earth.*> This leap in complexity, in 
a relatively short span of geological time, may well exceed the 
explanatory resources of natural selection working on random mutations. 
We will return to that question in Part Two. 

The Ediacaran fossils therefore do not solve the problem of the 
sudden increase in biological form and complexity during the Cambrian. 
Instead, they represent an earlier, if less dramatic, manifestation of the 
same kind of problem. To biology’s “big bang,”® the Ediacaran biota 
add a significant “pow.” As paleobiologist Kevin Peterson, of Dartmouth 
College, and his colleagues note, these fauna represent “an apparent 
quantum leap in ecological complexity as compared with the ‘boring 
billions’ [ of years] that characterize Earth before the Ediacaran,” even if 
these organisms are “still relatively simple when compared with the 
Cambrian,” which they characterize as another “quantum leap in 
organismal and ecological complexity.”?7 

Many paleontologists now refer to the Ediacaran radiation as an 
explosion in its own right. This Precambrian “pow” makes the 
problem of fossil discontinuity only more acute, since credible 
intermediates leading to the Ediacaran layers are completely nonexistent 
in the even more sparsely populated strata beneath them. 

Finally, even if one regards the appearance of the Ediacaran fossils as 
evidence of a “fuse” leading to the Cambrian explosion as some have 
proposed,?? the total time encompassed by the Ediacaran and Cambrian 
radiations still remains exceedingly brief relative to the expectations and 
requirements of a modern neo-Darwinian view of the history of life. As I 
will explain in more detail in Chapter 8, neo-Darwinism is the modern 
version of Darwin’s theory that invokes random genetic changes called 
mutations as the source of much of the new variation upon which natural 
selection acts. Like classical Darwinism, the neo-Darwinian mechanism 


requires great stretches of time to produce novel biological form and 
structure. Yet, current studies in geochronology suggest that only 40 to 
50 million years elapsed between the beginning of the Ediacaran 
radiation (570-565 million years ago) and the end of the Cambrian 
explosion (525-520 million years ago).*° To anyone unfamiliar with the 
equations of population genetics by which neo-Darwinian evolutionary 
biologists estimate how much morphological change is likely to occur in 
a given period of time, 40 to 50 million years may seem like an eternity. 
But empirically derived estimates of the rate at which mutations 
accumulate imply that 40 to 50 million years does not constitute 
anything like enough time to build the necessary anatomical novelties 
that arise in the Cambrian and Ediacaran periods. I will describe this 
problem in more detail in Chapter 12. 

Until recently, radiometric studies had estimated the duration of the 
Cambrian radiation itself at 40 million years, a period of time so brief, 
geologically speaking, that paleontologists had already dubbed it an 
“explosion.” The relative suddenness of the Cambrian explosion, even 
on the earlier measure of its duration, had already raised serious 
questions about the adequacy of the neo-Darwinian mechanism; 
consequently, it had also raised questions about whether a Darwinian 
understanding of the history of life could be reconciled with the 
Cambrian and Precambrian fossil record. Thus, treating the Ediacaran 
and the Cambrian radiations as one continuous evolutionary event (itself 
an unrealistically generous assumption) only returns the problem to its 
earlier (pre-zircon redating) status. 

For all these reasons, the late Precambrian fossils have not solved, 
but instead have deepened, the mystery of the origin of animal form. 
And few leading Cambrian paleontologists, of whom I was aware on 
that September evening in 2009 while preparing to answer questions at 
the University of Oklahoma, thought otherwise. 


EDIACARAN EXOTICA 


So what about the claim that certain exotic Ediacaran fossils are 
plausible ancestors to the Cambrian animal forms, even if better-known 
Ediacaran forms such as Dickinsonia, Charnia, and Spriggina are 
probably not? Did these exotic forms solve the mystery of the Cambrian 
explosion? 

Only a few years before my visit to the University of Oklahoma, I 
had written a scientific review article with research help from several 
colleagues, including a paleontologist and a marine biologist.*! (The 
latter was Paul Chien, who helped discover the Precambrian sponge 
embryos discussed in the previous chapter.) In our review article, I 
explained many of the problems with treating the Ediacaran as 
transitional intermediates discussed above. In the process of doing the 
research for that article, my colleagues and I encountered few 
paleontologists who thought that Parvancorina, Arkarua (see Fig. 4.2), 
or Vernanimalcula represented definitive ancestors of the Cambrian 
bilaterians, arthropods, or echinoderms. Could we have missed 
something? 

In fact, leading Cambrian authorities have dismissed associations 
between these odd fossil forms and the Cambrian animals. Nevertheless, 
in his talk before the showing of our film, the local professor from the 
University of Oklahoma asserted that the rather indistinct fossil form 
found in the Ediacaran Hills called Parvancorina represented a plausible 
ancestor of the arthropods. Some have described Parvancorina as a 
shield-shaped fossil form with a raised anchor-shaped ridge impressed 
atop it, bearing a superficial resemblance in its shape to that of a trilobite 
—thus, the claim that it might have represented an early arthropod. Yet 
leading Cambrian paleontologists dispute this association. Cambrian 
expert James Valentine has argued that Parvancorina is not convincing 
as an arthropod ancestor, and for good reason. Parvancorina fossils lack 
a head, jointed limbs, and compound eyes, all distinctive features of 
arthropods. Thus, Valentine noted that Parvancorina fossils “have not 
been shown to share derived features” with arthropods.” 





FIGURE 4.2 

Figure 4.2a (left): Photograph of Arkarua fossil, courtesy Taylor & 
Francis, Ltd. Figure 4.2b (right): Photograph of Parvancorina fossil, 
courtesy Peterson, K. J., Cotton, J. A., Gehling, J. G., and Pisani, D., 
“The Ediacaran Emergence of Bilaterians: Congruence between the 
Genetic and the Geological Fossil Records,” Philosophical Transactions 
of the Royal Society B, 2008, 363 (1496): 1435-43, Figure 2, by 
permission of the Royal Society. 


Valentine makes much the same point about the small disc-like 
imprint called Arkarua, one of the other Ediacaran forms cited by the 
University of Oklahoma professor that night at the Sam Noble Museum. 
Valentine points out that it too lacks many distinctive features of the 
animal phylum to which it is typically assigned. Indeed, those who 
propose Arkarua as an ancestor of the Cambrian animals usually claim 
that it represents an early echinoderm (as the professor in Oklahoma 
did). Echinoderms include starfish, sand dollars, and other animals with 
five-fold symmetry extending from a central body cavity.*7 Some have 
perceived five tiny segmented divisions within the circular impressions 
left by Arkarua, making them seem roughly similar to some modern 
echinoderms. But that similarity has proven superficial at best. Other 


paleontologists observe that Arkarua lacks a stereom, or water vascular 
system, a definitive diagnostic feature of echinoderms; thus, its 
“echinoderm-specific features are not readily visible.”** Valentine has 
argued that, absent such telltale features, the relationship of Arkarua to 
echinoderms “remains uncertain.””*> 

In the case of Vernanimalcula, the story is more complicated but 
equally problematic. Vernanimalcula is the name that Chinese 
paleontologists gave to an imprint in phosphorite sediment found in the 
Doushantuo Formation in 2004. They found the structure in 580- to 600- 
million-year-old rocks, making the impression even older than the 
Ediacaran strata. The paleontologist David Bottjer of the University of 
Southern California, and some Chinese paleontologists (at least, 
initially), speculated that the Vernanimalcula imprint might be the 
remains of an early bilaterian.*° 

Recall that bilaterians are animals whose parts found on one side of 
the body midline are also found in mirror image on the other (as opposed 
to, say, a radially symmetric animal*’). Figure 4.3 shows a picture of the 
structure of Vernanimalcula first found in the Doushantuo Phosphorite 
formation. Some paleontologists think that Vernanimalcula exhibits such 
bilateral symmetry and thus might be ancestral to the bilaterian animals 
that later first appeared in the Cambrian period. 

But problems have emerged with this argument. First, the form of the 
Vernanimalcula does not resemble any specific bilaterian animal. In 
addition, recent scientific analyses of these remains have questioned 
whether this imprint preserves the remains of animals and, therefore, 
bilaterians at all. For example, in 2004, Stefan Bengtson and Graham 
Budd, two paleontologists and Cambrian experts, published a detailed 
chemical and microscopic analysis of these fossils in the journal 
Science.*® They concluded that the structures preserved in phosphorite 
rocks had undergone significant alteration by so-called diagenesis and 
taphonomic processes. Diagenesis refers mainly to processes of 
chemical alteration that occur after sediments are deposited and before 
sedimentary rocks are fully hardened, or “lithified.” Taphonomic 


processes are those that alter once living organisms after burial and 
preservation in sediments. 





FIGURE 4.3 

Photograph of Vernanimalcula fossil. Courtesy American Association for 
the Advancement of Science, from Chen, J.-Y., Bottjer, D. J., Oliveri, P., 
Dornbos, S. Q., Gao, F., Ruffins, S., “Small Bilaterian Fossils from 40 to 
55 Million Years Before the Cambrian,” Science, 305 (July 9, 2004): 
218-22, Figure 1b. Reprinted with permission from AAAS. 


Based on their microscopic analysis, Bengtson and Budd rejected the 
hypothesis that these structures preserved the remains of an animal form. 
Instead, they argued that the phosphorite imprint exhibited distinctive 
features of the chemically altered remains of one-celled microfossils— 
microfossils that had been encrusted with layers of chemical residue 
from various diagenetic processes.*” 

More recently, in 2012, Bengtson and three other colleagues 
published another paper sharply critical of the view that Vernanimalcula 


represents an ancestor of the bilaterian animals—or even an animal. 
They show that the “structures key to animal identity are effects of 
mineralization that do not represent biological tissues.” For this reason 
they conclude: “There is no evidential basis for interpreting 
Vernanimalcula as an animal, let alone a bilaterian.”>” 

Though the paper was titled “A Merciful Death for the ‘Earliest 
Bilaterian,’ Vernanimalcula,” the authors were anything but merciful in 
wielding their arguments. Their article upbraided David J. Bottjer, the 
main paleontologist who has promoted the interpretation of 
Vernanimalcula as a bilaterian ancestor, for seeing what he wanted to see 
and disregarding the clear evidence of nonbiological mineralization. In a 
2005 Scientific American article, Bottjer interpreted Vernanimalcula as 
the “oldest fossil animal with a bilaterian body plan yet discovered.” In 
that article, Bottjer claimed that Vernanimalcula confirmed the 
“suspicion that complex animals have a much deeper root in time” and 
“that the Cambrian was less of an explosion and more of a flowering of 
animal life.”°! After unequivocally rejecting Bottjer’s interpretation on 
the basis of their geochemical analysis, Bengtson and his coauthors 
rebuked Bottjer in rather personal terms: 


It is likely that the fossils referred to [as ] Vernanimalcula were 
interpreted as bilaterians because this was . . . the explicit quarry 
of its authors. If you know from the beginning not only what you 
are looking for, but what you are going to find, you will find it, 
whether or not it exists. As Richard Feynman (197 4) famously 
remarked: “The first principle is that you must not fool yourself— 
and you are the easiest person to fool.” ... Once you have fooled 
yourself you will fool other scientists.°7 


Bengtson and his colleagues insist that however much a few 
paleontologists such as Bottjer might have wanted to “heap” 
“evolutionary significance” on Vernanimalcula in order to relieve their 
cognitive dissonance about the Cambrian explosion, the evidence does 
not bear the weight of interpretation that had been placed upon it. Thus, 


they conclude Vernanimalcula should be “laid to a merciful rest,” since 
the interpretation of it had “taken on a life of its own, a life it never 
had.”>? 


A DEEPER PROBLEM 


Though back in 2009, Jonathan Wells and I didn’t know about the most 
recent critical analysis of Vernanimalcula’s pretentions, we did know 
that many leading paleontologists had rejected attempts to identify 
Vernanimalcula as an animal form. Thus, during the question-and- 
answer period following the film, Jonathan Wells explained why these 
and other obscure and enigmatic Precambrian fossils (or imprints) failed 
to qualify as convincing precursors to any of the Cambrian animals, 
citing the work of leading authorities in paleontology. In each case, he 
noted that similarities between the Ediacaran forms and later Cambrian 
animals had proven superficial, because the Ediacaran forms lacked 
many key diagnostic features of specific Cambrian phyla. 

At the same time, as I later reflected on the lecture, I recognized a 
deeper problem with attempts to resolve the mystery of the Cambrian 
explosion by pointing to a few Precambrian fossils. Many defenders of 
the Darwinian picture of the history of life seemed to assume that the 
discovery of any alleged Precambrian animal forms, however 
implausible as ancestors of specific Cambrian animals or however 
sparsely distributed in the vast sequences of Precambrian strata, would 
solve the mystery of the Cambrian explosion, especially if these forms 
exemplified some abstractly perceived commonality such as bilateral 
symmetry. 

To see what’s wrong with this way of thinking, imagine an ambitious 
distance swimmer claiming that it would be possible to swim between 
California and Hawaii over a period of many months or years because of 
the small islands that provide way stations where he could eat, rest, and 
overnight at each stage along his marathon journey. But instead of 
showing that an archipelago dotting the route between California and 
Hawaii at reasonably accessible intervals actually exists, he points to a 


couple of barren atolls in the South Pacific far from the most likely 
course to Hawaii. Clearly, in that case, the claims of our intrepid 
hypothetical swimmer would not be credible. In a similar way, the 
claims of those who assert that a few isolated and anatomically 
enigmatic forms of life in the Precambrian solve the problem of the 
Cambrian explosion also lack credibility. 

To appreciate another aspect of this problem, let’s revisit the claims 
about Vernanimalcula as a possible ancestor of all the bilaterian phyla. 
On the one hand, for such a form to qualify as the ancestor common to a 
large number of specific phyla (such as the many bilaterian phyla that 
arise in the Cambrian), it must exhibit the basic bilaterian characteristics 
such as bilateral symmetry and what is called “triploblasty,” the presence 
of three distinct tissue layers (endoderm, mesoderm, and ectoderm). At 
the same time, a viable candidate for the role of common ancestor 
cannot by definition manifest any of the differentiating characteristics 
that distinguish the individual Cambrian phyla and their respective body 
plans from each other. For example, any bilaterian that manifests the 
characteristic exoskeleton of, say, an arthropod cannot also qualify as a 
plausible ancestor of a chordate, because chordates have internal 
skeletons or notochords. The logic of these distinct body designs 
precludes sharing both anatomical characteristics. For this reason, any 
hypothetical bilaterian common ancestor could only have existed as a 
kind of lowest anatomical common denominator, or what evolutionary 
biologists call a “ground plan,” having only those few features that are 
common to all of the animal forms that allegedly evolved from it. 

But this creates a dilemma. If a fossilized form is simple enough to 
qualify as the common ancestor of later highly differentiated bilaterian 
phyla, then it will necessarily lack most of the important distinguishing 
anatomical features of those specific phyla. That means that all the 
interesting anatomical novelties that differentiate one phylum from 
another must arise along the separate lineages branching out from the 
alleged common ancestor well after its origin in the fossil record. Heads, 
jointed limbs, compound eyes, guts, anuses, antennae, notochords, 
stereoms, lophophores (a tentacled feeding organ), and numerous other 


distinguishing characteristics of many different animals must come later 
on many distinct lines of descent. Yet the gradual evolutionary origin of 
these characteristics is not documented in the Precambrian fossil record. 
These characteristics do not appear until they arise suddenly in the 
Cambrian explosion. 

For this reason, indistinct fossils such as Vernanimalcula—even if we 
take them as representing a common ancestor of many bilaterians— 
document little of the Darwinian story of the history of animal life. 
Hugely significant gaps in the fossil record would still remain, because 
the Precambrian fossil record simply does not document the gradual 
emergence of the crucial distinguishing characteristics of the Cambrian 
animals. The important anatomical novelties that define the individual 
Cambrian phyla as well as their first clear representatives arise as 
suddenly as ever. 

To say that a form such as Vernanimalcula, or any of the other 
relatively indistinct Ediacaran forms, solves the problem of the missing 
Precambrian fossil record would be a bit like saying that a metal 
cylinder demonstrates all the steps involved in the construction of a 
toaster, automobile, submarine, or jet airplane simply because all these 
technological objects utilize “metal enclosures.” It’s true that each of 
these complex systems uses metal enclosures, but the presence of an 
enclosed metal surface is only a necessary, and not nearly a sufficient, 
condition of the origin of these various technological systems. Similarly, 
finding a simple but otherwise unadorned bilaterally symmetric form of 
life would hardly solve the problem of fossil discontinuity, because it 
would not by itself document the emergence of the unique characteristics 
of the individual bilaterian animals. 

This paradox is well known to paleontologists who work on the 
Cambrian radiation. Charles Marshall and James Valentine, for instance, 
describe the difficulty of attempting to characterize an “undiagnostic” 
group, by which they mean a possible ancestral “stem” group that lacks 
the specialized characteristics of its presumptive evolutionary progeny. 
They write: 


When trying to unravel the origins of the animal phyla. . . the 
hardest to examine is the phase between the actual cladogenic 
origin of a phylum and the time that it acquired its first phylum- 
specific characteristic(s). Even if we have fossils from this phase 
in a phylum’ history, we will not be able to prove their kinships at 
the level of phyla.*4 


Thus, even if Vernanimalcula, or some other fossil form, were simple 
enough and animal-like enough to qualify as a so-called ur (or original) 
form of animal life, it would paradoxically, for just that reason, be 
incapable of establishing itself as an unequivocal common ancestor of 
some specific Cambrian phylum. 

And there’s no relief in the other direction. If an alleged ancestral 
form manifests the distinguishing features of one of the specific 
Cambrian phyla—if, for example, Vernanimalcula or some other 
isolated form presented a convincing set of distinctively arthropod or 
chordate or echinoderm characteristics, then the very presence of those 
features would necessarily preclude the possibility of that specific 
animal form representing the common ancestor of all other Cambrian 
forms. The more an animal form manifests the characteristics of one 
phylum or group within the phylum, the less plausible it becomes as the 
ancestor of all the other animal phyla. 

And that is the dilemma in a nutshell. Highly differentiated and 
complex Precambrian forms by themselves could not have been 
ancestors common to all the Cambrian phyla; whereas undifferentiated 
forms simple enough to have been ancestral to all the Cambrian phyla 
leave no evidence, by themselves, of the gradual emergence of the 
complex anatomical novelties that define the Cambrian animals. Either 
way—whether the few alleged Precambrian ancestors are viewed as 
simple and relatively undifferentiated or complex and highly 
differentiated—the fossil record, given its otherwise pervasive pattern of 
discontinuity, does not establish the gradual evolution of numerous 
anatomical and morphological novelties. Instead, only a true series of 
transitional intermediates in which the fossil record documents both the 


existence of an original animal form and the gradual appearance of the 
key distinguishing anatomical features and novelties (and the Cambrian 
animals themselves) would remedy this deficiency. And yet that is 
precisely what the Precambrian fossil record has failed to document. 

As Graham Budd and Soren Jensen state, “The known 
[ Precambrian/Cambrian] fossil record has not been misunderstood, and 
there are no convincing bilaterian candidates known from the fossil 
record until just before the beginning of the Cambrian (c. 543 Ma), even 
though there are plentiful sediments older than this that should reveal 
them.”>> Thus they conclude, “The expected Darwinian pattern of a deep 
fossil history of the bilaterians, potentially showing their gradual 
development, stretching hundreds of millions of years into the 
Precambrian, has singularly failed to materialize.”>° 


DILEMMA ON DISPLAY 


During the question-and-answer session that followed the screening of 
Darwins Dilemma, none of the University of Oklahoma Ph.D. students 
or science faculty who attended the museum-sponsored lecture 
challenged my colleague Jonathan Wells when he explained why leading 
paleontologists do not think the exotic Precambrian forms cited in the 
lecture were ancestors of Cambrian forms. This nonreaction seemed a 
little odd given the stress the museum’s own expert had laid on these 
claims, and given that he had made these claims rather emphatically in 
the same building to many of the same people just three hours earlier. 

On our flight out of town the next day, Jonathan Wells told me 
something that cast our experience there in an even odder light. He’d 
had a chance to walk around the Sam Noble Science Museum after the 
lecture and before our event. He discovered that the museum has a 
display that vividly illustrates the severity of what we called Darwin’s 
dilemma. Wells recorded some of his observations on his return to 
Seattle. One part of the account of what he saw while touring the 
museum’s own display about the Cambrian explosion is worth quoting at 
length: 


[The display ] seemed factually accurate for the most part, 
emphasizing (among other things) that many of the Cambrian 
explosion fossils were soft-bodied—which puts the lie to the 
common explanation that their precursors are absent from the 
fossil record because they lacked hard parts. The exhibit also 
made it clear that the Ediacaran fossils went extinct at the end of 
the pre-Cambrian, so (with a few possible exceptions) they could 
not have been ancestral to the Cambrian phyla. One particular 
panel in the exhibit caught my attention. It showed over a dozen of 
the Cambrian phyla at the top of a branching tree with a single 
trunk, but none of the branch points corresponded to a real living 
thing. Instead, the branch points were artificial technical 
categories such as “Ecdysozoa,” “Lophotrochozoa, ” 
“Deuterostomia,” and “Bilateria.” The artificiality of the branch 
points emphasized that the branching-tree pattern imposed on the 
fossil evidence was itself an artificial construct. 


So, after all the controversy, it turned out that the museum that had 
sponsored the lecture denying Darwin’s Cambrian dilemma itself has an 
excellent display indicating that the expected ancestral forms of the 
Cambrian animals—the very ones that Darwin hoped to find a hundred 
and fifty years ago—are still missing from the Precambrian fossil record. 
But then why would the museum sponsor the presentation that it did? 
It’s hard to say, I suppose, but I’ve seen this dynamic before in 
discussions about Darwinian evolution. Evolutionary biologists will 
acknowledge problems to each other in scientific settings that they will 
deny or minimize in public, lest they aid and abet the dread 
“creationists” and others they see as advancing the cause of unreason. 
Perhaps just our presence on campus raising questions about 
contemporary Darwinism made them feel defensive on behalf of 
“science.” It’s an understandable, if ironic, human reaction, of course, 
but one that in the end deprives the public of access to what scientists 
actually know. It also perpetuates the impression of evolutionary biology 
as a science that has settled all the important questions at just the time 


when many new and exciting questions—about the origin of animal 
form, for example—are coming to the fore. 


5 
The Genes Tell the Story? 


Reconstructing the history of life has a lot in common with detective 
work. Neither detectives nor evolutionary biologists can directly 
observe the events in the past that interest them most. Detectives 
typically did not see the crime occur. Evolutionary biologists did not 
witness the origin of animals or other groups of organisms. Yet this 
limitation doesn’t mean either group of investigators lacks the evidence 
to determine with some confidence what happened. Detectives and 
evolutionary biologists as well as many other historical scientists— 
paleontologists, geologists, archeologists, cosmologists, and forensic 
scientists—do this regularly, based on careful inference from the clues 
or evidence left behind. 

Many evolutionary biologists have commented on the forensic nature 
of their work. Here’s how Richard Dawkins puts it: “I have used the 
metaphor of a detective, coming on the scene of the crime after it is all 
over and reconstructing from the surviving clues what must have 
happened.”! 

Perhaps the most obvious surviving traces of ancient life are fossils. 
But as evolutionary biologists and paleontologists have come to realize 
that the Precambrian fossil record has not furnished the confirmation 
that Darwin hoped it would, many have looked to other kinds of clues to 
establish the gradual emergence of Cambrian animal life from a 
common ancestor. 

In this effort, contemporary evolutionary biologists have followed 
the example of Darwin himself. Though Darwin argued that a general 


progression of simpler to more complex forms of life in the fossil record 
meshed nicely with his theory, he was acutely aware that the 
discontinuity of the fossil record, particularly as evidenced in 
Precambrian and Cambrian strata, did not. This is why he emphasized 
other types of evidence to establish his theory of universal common 
descent. 

In a famous chapter in On the Origin of Species titled “The Mutual 
Affinities of Organic Beings,” Darwin made his case not on the basis of 
the fossil evidence, but on the basis of similar anatomical structures in 
many distinct organisms. He noted, for example, that the forelimbs of 
frogs, horses, bats, humans, and many other vertebrates exhibited a 
common five-digit (“pentadactyl’) structure or organization (see 
Fig. 5.1). To explain such “homologies,” as he called them, Darwin 
posited a vertebrate ancestor that possessed pentadactyl limbs in 
rudimentary form. As a menagerie of modern vertebrates evolved from 
this common ancestor, each retained in its own way the basic 
pentadactyl mode of organization. For Darwin, his theory of descent 
with modification from a common ancestor explained these similarities 
better than the received view of many older nineteenth-century 
biologists such as Louis Agassiz or Richard Owen, both of whom 
thought homologies reflected the common design plan of a creative 
intelligence. 
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FIGURE 5.1 
The common five-digit pattern of the pentadactyl limb as manifested in 
four modern animals. Copyright Jody F. Sjogren. Used with permission. 


In reconstructing the evolutionary history of life, most evolutionary 
biologists today emphasize the importance of homology. They assume 
that similarities in anatomy and in the sequences of information-bearing 
biomacromolecules such as DNA, RNA, and protein point strongly to a 
common ancestor.” They also assume that the degree of difference in 
such cases is on average proportional to the time elapsed since the 
divergence from a common ancestor. The greater the difference in the 
common feature or molecular sequence, the farther back the ancestor 
from which the feature or sequence arose. 

Evolutionary biologists have used this approach to try to discern the 
evolutionary history of the Cambrian animals. If the Precambrian fossil 
record refuses to disclose the secrets of Precambrian evolution, so goes 
the thinking, perhaps the study of comparative anatomy and molecular 
homologies will. Given the well-established problem with the fossil 


evidence, many evolutionary biologists now particularly emphasize the 
importance of clues from molecular genetics. As evolutionary biologist 
Jerry Coyne, of the University of Chicago, notes, “Now we have a 
powerful, new, and independent way to establish ancestry: we can look 
directly at the genes themselves. By sequencing the DNA of various 
species and measuring how similar these sequences are, we can 
reconstruct their evolutionary relationships.”* 

There are two aspects of this endeavor. First, by analyzing the genes 
of existing animals representing phyla that first arose in the Cambrian, 
scientists have attempted to establish when the common ancestor of the 
Cambrian animal forms lived. This effort has generated what is known 
as the “deep-divergence hypothesis,” which holds that the common 
ancestor of all animal life arose long before the Cambrian explosion. 
Second, by analyzing anatomical and molecular similarities, biologists 
have attempted to reconstruct the Precambrian—Cambrian tree of life, 
mapping the course of evolution during a cryptic period before the 
Cambrian. 

Defenders of neo-Darwinism assert that these techniques have 
produced a coherent evolutionary picture of the early history of animal 
life. They assert that clues from the realm of genetics point 
unequivocally to Precambrian ancestral forms and to an evolutionary 
history that fossils have failed to document. 

This chapter will examine what genes tell us about the alleged 
universal common ancestor of all animals; the next chapter will consider 
whether the analysis of genes (and other features of organisms) yield a 
coherent treelike picture of the Precambrian prehistory of animal life. 
Genetic analyses have indeed revealed a trove of clues. The question is: 
Do those genetic clues establish the Precambrian ancestor and history 
that fossils have failed to document, or, as sometimes occurs in criminal 
investigations, has there been a rush to judgment? 


DEEP DIVERGENCE 


Many paleontologists and evolutionary biologists now concede that the 
long-sought-after Precambrian fossils, those necessary to document a 
Darwinian account of the origin of animal life, are missing.* Scientists 
are especially candid about this when addressing each other in the 
technical peer-reviewed literature. Often, however, defenders of 
evolutionary orthodoxy raise another possibility—that the common 
ancestor of the Cambrian animals has been documented after all, not by 
fossil evidence, but by molecular or genetic evidence of what they call a 
“deep divergence” of animal life. In making such claims, these 
biologists clearly privilege molecular evidence over evidence from the 
fossil record. 

Proponents of deep divergence don’t deny that the fossil evidence 
has come up short. Instead, they adopt one of the versions of the artifact 
hypothesis to account for that missing evidence. They then argue that 
there was no “explosion” of animal forms in the Cambrian, but rather a 
“long fuse” of animal evolution and diversification lasting many 
millions of years leading up to what only looks like an “explosion” of 
animal life in the Cambrian, but this evolutionary history was hidden 
from the fossil record. Indeed, they argue that molecular evidence 
establishes a long period of undetected or cryptic evolution in 
Precambrian times, beginning from a common ancestor some 600 
million to 1.2 billion years ago, depending upon which study of the 
molecular genetic data they cite. If correct, the Cambrian phyla may 
have had many hundreds of millions of years to evolve from a common 
ancestor.” 


THE MOLECULAR CLOCK 


Advocates of deep divergence use a method of analysis known as the 
“molecular clock.” Molecular clock studies also assume that the extent 
to which sequences differ in similar genes in two or more animals 
reflects the amount of time that has passed since those animals began to 
evolve from a common ancestor. A small difference means a short time; 
a big difference, a long time. To determine exactly how short or long, 


these studies estimate the mutation rate by analyzing genes in two 
species or taxa that are thought to have evolved from an ancestor whose 
presence in the fossil record can be discerned and dated accurately. For 
example, many molecular-clock studies of birds and mammals are 
calibrated based on the age of an early reptile thought to be the most 
recent common ancestor of both. 
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FIGURE 5.2 

The idea behind the molecular clock. The two animals and their 
homologous gene sequences at the right of the figure show the 
molecular distance between two present-day animals, that is, how many 
mutational differences have accumulated over time since they diverged 
on the tree of life. The animal at the left of the figure (a mammal-like 
reptile) represents the common ancestor from which these animals 
presumably evolved. Knowing how long ago the common ancestor (the 
mammal-like reptile) lived, and how many mutational differences have 
accumulated in its descendants during that time, allows scientists to 
calculate a mutation rate. In theory, once the mutation rate has been 
determined, it can be used to calculate the divergence time of other 
present-day species, after their homologous genes have been compared 
for differences. 


Genetic comparisons enable evolutionary biologists to estimate the 
number of mutational changes since divergence, and dating of the strata 
containing presumed fossil ancestors tells how long ago the divergence 
occurred. Assuming that different lineages evolve at the same rate,° 
together the two pieces of information enable evolutionary biologists to 
calculate a baseline mutation rate. They can then use that rate to 
determine how long ago some other pair of animals diverged from each 
other on the evolutionary tree (see Fig. 5.2).’ 

Advocates of the deep-divergence hypothesis have applied this 
method to analyzing similar genes, RNA molecules, or proteins in pairs 
of animals belonging to phyla that first arose in the Cambrian period. In 
this way they estimate how long it took for the different animal phyla to 
diverge from a common Precambrian ancestor. 


DEEP AND DEEPER: EVIDENCE FOR DEEP DIVERGENCE 


In the 1990s, evolutionary biologists Gregory A. Wray, Jeffrey S. 
Levinton, and Leo H. Shapiro performed a major study of Cambrian- 
relevant molecular sequence data. In 1996, they published their results 
in a paper entitled “Molecular Evidence for Deep Precambrian 
Divergences Among Metazoan Phyla.”* Wray’s team compared the 
degree of difference between the amino-acid sequences of seven 
proteins’ derived from several different modern animals representing 
five Cambrian phyla (annelids, arthropods, mollusks, chordates, and 
echinoderms). They also compared the nucleotide base sequences of a 
ribosomal RNA molecule!” from the same animal representatives of the 
same five phyla. 

The Wray study concluded that the common ancestor of the animal 
forms lived 1.2 billion years ago, implying that the Cambrian animals 
took some 700 million years to evolve from this “deep-divergence” 
point before first appearing in the fossil record. Wray and his colleagues 
attempted to explain the absence of fossil ancestral forms during this 
period of time by postulating that Precambrian ancestors existed in 
exclusively soft-bodied forms, rendering their preservation unlikely. 


More recently, Douglas Erwin and several colleagues performed a 
study comparing the degree of sequence difference between other genes 
—seven nuclear housekeeping genes!! and three ribosomal RNA 
genes!? across 113 different species of living Metazoa. (The term 
“Metazoa” refers to animals with differentiated tissue. The term 
“metazoan” refers to one such animal or can be used as an adjective, as 
in “the metazoan phyla.”) They estimated that “the last common 
ancestor of all living animals arose nearly 800 million years ago.”!° 

Many similar studies affirm a very ancient or “stratigraphically 
deep” divergence of the animal forms, in opposition to those who claim 
that the Cambrian animals appeared suddenly within just a few million 
years.'* Each of these studies affirms the gradual emergence of animal 
life that most researchers expected to find on the basis of a Darwinian 
picture of the history of animal life. Indeed, a major aim of the Wray 
study was to challenge the view “that the animal phyla diverged in an 
‘explosion’ near the beginning of the Cambrian period.”!> Wray and his 
colleagues argue instead that “all mean divergence time estimates 
between these four phyla and chordates, based on all seven genes, 
substantially predate the beginning of the Cambrian period.”!° They 
conclude: “Our results cast doubt on the prevailing notion that the 
animal phyla diverged explosively during the Cambrian or late Vendian, 
and instead suggest that there was an extended period of divergence 
during the mid-Proterozoic, commencing about a billion years ago.”!” 

From an orthodox Darwinian point of view, the conclusions of these 
studies seem almost unavoidable since (1) the neo-Darwinian 
mechanism requires vast amounts of time to produce anatomical novelty 
and (2) such phylogenetic analyses assume that all the animal forms 
descended from a common ancestor. Many evolutionary biologists 
claim that clues long hidden in DNA now confirm these Darwinian 
axioms and, consequently, the existence of an extremely ancient, 
Precambrian ancestor of the Cambrian animals. As Andrew Knoll, a 
Harvard paleontologist, states, “The idea that animals should have 


originated much earlier than we see them in the fossil record is almost 
inescapable.” !® 


REASONABLE DOUBT 


Nevertheless, there is now good reason to doubt this allegedly 
overwhelming genetic evidence. In the idiom of our forensic metaphor, 
other material witnesses (fossils) have already come forward to testify, 
the testimony of the genes (and other key indicators of biological 
history) is grossly inconsistent, and that genetic testimony has come to 
us through a translator, who is shaping the way the jury perceives the 
evidence. Let’s look at each of these problems in turn. 


Fossil Testimony 


Recall that the deep-divergence hypothesis has two components. One of 
them—a version of the artifact hypothesis—provides an explanation for 
why the Precambrian ancestral fossils are missing. And here the deep- 
divergence hypothesis first runs into trouble. As we saw in Chapter 3, 
there is no currently plausible version of the artifact hypothesis. The 
preservation of numerous soft-bodied Cambrian animals as well as 
Precambrian embryos and microorganisms undermines the idea of an 
extensive period of undetected soft-bodied evolution. In addition, the 
claim that exclusively soft-bodied ancestors preceded the hard-bodied 
Cambrian forms remains anatomically implausible. A brachiopod 
cannot survive without its shell. An arthropod cannot exist without its 
exoskeleton. Any plausible ancestor to such organisms would have 
likely left some hard body parts, yet none have been found in the 
Precambrian. Yet the deep-divergence hypothesis, whatever its other 
merits, requires a viable artifact hypothesis to explain the absence of 
fossilized Precambrian ancestors. 


The Testimony o fGenes: Con ficting Stories 


There is a second, more telling reason to doubt the deep-divergence 
hypothesis: the results of different molecular studies have generated 
widely divergent results. Yet presumably there was only one common 
ancestor of all the Metazoa and only one ultimate divergence point. 

For example, comparing the Wray-led study and the Erwin-led study 
generates a difference of 400 million years. In the case of other studies, 
even greater differences emerge. Many other studies have thrown their 
own widely varying numbers into the ring, placing the common 
ancestor of animals anywhere between 100 million and 1.5 billion years 
before the Cambrian explosion (some molecular clock studies, oddly, 
even place the common ancestor of the animals after the Cambrian 
explosion).!? As Douglas Erwin, writing with fellow paleontologists 
James Valentine and David Jablonski, acknowledged in 1999: 
“Attempts to date those branching[s]” from a common Precambrian 
ancestor “by using molecular clocks have disagreed widely.””° How can 
this be? 

In the first place, different studies of different molecules generate 
widely divergent dates. In addition to the studies I have already cited, a 
1997 paper by Japanese biologist Naruo Nikoh and colleagues 
examined two genes (aldolase and triose phosphate isomerase), and 
dated the split between eumetazoa and parazoa—animals with tissues 
(like cnidarians) from those without (like sponges)—at 940 million 
years ago.”! Compare that to a 1999 paper by Daniel Wang, Sudhir 
Kumar, and S. Blair Hedges based on the study of 50 different genes, 
showing that “the basal animal phyla (Porifera, Cnidaria, Ctenophora) 
diverged between about 1200-1500 Ma.” 

Sometimes contradictory divergence times are reported in the same 
article. For instance, a refreshingly forthright paper by evolutionary 
biologist Lindell Bromham of Australian National University and 
colleagues in Proceedings of the National Academy of Sciences USA 
analyzed two different molecules, mitochondrial DNA and 18S rRNA, 
to yield individual gene-based divergence dates that differed by as much 
as 1 billion years.*? Another study investigating the divergence between 
arthropods and vertebrates found that depending on which gene was 


used, the divergence date might be anywhere between 274 million and 
1.6 billion years ago, the former date falling almost 250 million years 
after the Cambrian explosion.** That paper in its conclusion chose to 
split the difference, confidently reporting an arithmetic average of about 
830 million years ago. Likewise, bioinformaticians Stéphane Aris- 
Brosou, now at the University of Ottawa, and Ziheng Yang, at 
University College London, found that depending on which genes and 
which estimation methods were employed, the last common ancestor of 
protostomes or deuterostomes (two broadly different types of Cambrian 
animals) might have lived anywhere between 452 million and 2 billion 
years ago.” 

A survey of recent deep-divergence studies, by molecular 
evolutionists Dan Graur and William Martin, notes one study in which 
the authors claim to be 95 percent certain that their divergence date for 
certain animal groups falls within a 14.2-billion-year range—more than 
three times the age of the earth and clearly a meaningless result.*° Graur 
and Martin conclude that many molecular-clock estimates “look 
deceptively precise,” but, given the nature of this field, their “advice to 
the reader is: whenever you see a time estimate in the evolutionary 
literature, demand uncertainty! ”?” The title of their paper, published in 
Trends in Genetics, made the point still more vividly: “Reading the 
Entrails of Chickens: Molecular Timescales of Evolution and the 
Illusion of Precision.” 

Sometimes even different studies of the same or similar groups of 
molecules have generated dramatically different divergence times. For 
example, Francisco Ayala and several colleagues have recalculated the 
divergence times of Metazoan phyla, using mostly the same protein- 
coding genes as Wray’s team.*® Correcting for “a host of statistical 
problems””? in the Wray study, Ayala and colleagues found that their 
own estimates “are consistent with paleontological estimates’—not with 
the deep-divergence hypothesis. “Extrapolating to distant times from 
molecular evolutionary rates estimated within confined data-sets,” they 
conclude, is “fraught with danger.”?? Or as Valentine, Jablonski, and 


Erwin conclude, “The accuracy of the molecular clock is still 
problematical, at least for phylum divergences, for the estimates vary by 
some 800 million years depending upon the techniques or molecules 
used.”3! Reported Precambrian divergence times would vary even more 
dramatically were it not that evolutionary biologists and molecular 
taxonomists ignore certain molecules in their studies to avoid grossly 
contradictory results. Consider, for example, histones—proteins found 
in all eukaryotes involved in packing DNA into chromosomes. Histones 
exhibit little variation from one species to the next.** They are never 
used as molecular clocks. Why? Because the sequence differences 
between histones, assuming a mutation rate comparable to that of other 
proteins, would generate a divergence time at significant variance with 
those in studies of many other proteins.” Specifically, the small 
differences between histones yield an extremely recent divergence, 
contrary to other studies. Evolutionary biologists typically exclude 
histones from consideration, because those times do not confirm 
preconceived ideas about what the Precambrian tree of life ought to 
look like. 

But that raises obvious questions. If we don’t have fossils 
documenting a common animal ancestor, and if genetic studies produce 
such different and contradictory divergence times, how do we know 
what the tree of life should look like and when the first animals began to 
diverge from a common ancestor? If histones change too slowly to 
provide an accurate calibration of the molecular clock, then which 
molecules do change at the correct rate—and how do we know that they 
do? The answer to these questions for most evolutionary biologists 
usually runs something like this. We already know that the animal phyla 
evolved from a common ancestor and we also know roughly when they 
did; therefore, we must reject studies based on histone sequences 
because the conclusions of these studies would contradict that date. 

But do we really know these things, and if so how? Assumptions 
about the window of time in which the first metazoan, the ancestor of all 
animals, must have lived are clearly not derived from the testimony of 
molecular genetics alone, since the results of sequence comparisons 


vary so greatly and include dates, depending upon the molecule studied, 
that fall outside that window. Instead, as one widely used textbook 
euphemistically puts it, evolutionary biologists must choose 
“phylogenetically informative” data.’ By this, they mean sequences 
that exhibit neither too little nor too much variation—where too much 
and too little are determined by preconceived considerations of 
evolutionary plausibility, rather than by reference to independent criteria 
for determining the accuracy of molecular methods. 

The subjective quality of these conclusions, where scientists “cherry- 
pick” evidence that conforms to favored notions and discard the rest, 
casts further doubt on the extent to which molecular comparisons yield 
any clear historical signal. Only one divergence point could represent 
the actual universal common ancestor of all animals. If, however, 
comparative sequence analyses generate divergence times that are 
consistent with nearly all possible evolutionary histories, with the 
divergence event ranging from a few million to a few billion years ago, 
then clearly most of these possible histories must be wrong. They tell us 
little about the actual time of the Precambrian divergence, if such an 
event really happened. 


Questionable Assumptions 


Other problems run even deeper, having to do with the assumptions that 
make comparative sequence analyses possible in the first place. These 
comparisons assume the accuracy of molecular clocks—that mutation 
rates of organisms have remained relatively constant throughout 
geological time. These studies also assume, rather than demonstrate, the 
theory of universal common descent. Both assumptions are problematic. 

Even if we assume that mutation and natural selection (and other 
similarly undirected evolutionary processes) can account for the 
emergence of novel proteins and body plans, we cannot also assume that 
the protein molecular clock ticks at a constant rate. Unlike radiometric 
dating methods, molecular clocks depend on a host of contingent 
factors. As Valentine, Jablonski, and Erwin note, “Different genes in 


different clades evolve at different rates, different parts of genes evolve 
at different rates and, most importantly, rates within clades have 
changed over time.”*° So great is this variation that one paper in the 
journal Molecular Biology and Evolution cautions, “The rate of 
molecular evolution can vary considerably among different organisms, 
challenging the concept of the ‘molecular clock.’ ”*° 

Keep in mind too that molecular clocks are calibrated based on the 
estimated age of presumably ancestral fossils. If, however, such 
estimates are incorrect by even a few million years, or if the fossil used 
to calibrate the mutation rate does not lie at the actual divergence point 
on the tree of life, the estimated mutation rate may be badly skewed. 
Calibration of molecular clocks depends on an accurate understanding 
of the ancestor-descendant relationships between fossils and their 
presumed descendant taxa. If the fossil used to calibrate the divergence 
time of two later groups was not actually a true ancestor, then the 
mutation-rate calculation based on that fossil’s age may be grossly 
inaccurate. As Andrew Smith and Kevin Peterson note: “Molecular 
clocks are not error-free and come with their own suite of problems. .. . 
The accuracy of the technique depends upon having an accurate 
calibration point or points, and a reliable phylogeny with correct 
branching order and branch-length estimates.”*’ Because these 
conditions are rarely met, “the idea that there is a universal molecular 
clock ticking away has long since been discredited.”*® 

Applying the molecular clock to dating the alleged Precambrian 
ancestor of the animals further complicates matters. Because there are 
so few fossils in the Precambrian and no clear ancestor-descendant 
lineages, the calibration of the molecular clock must be done on the 
basis of very different fossil lineages arising hundreds of millions of 
years later. Indeed, without evidence from the fossil record (older than 
550 million years ago) with which to calibrate the molecular clock, any 
attempt to date the origin of the Cambrian animal phyla becomes highly 
questionable.*? Perhaps for this reason, Valentine, Jablonski, and Erwin 
have wondered whether “molecular clock dates can ever be applied 
reliably to such geologically remote events as Neoproterozoic 


branchings within the Metazoa.’*? (The Neoproterozic is the last 
Precambrian era.) These methodological problems may help account for 
the cacophony of conflicting results. 


SMUGGLING IN DARWIN 


A second crucial assumption behind the deep-divergence hypothesis is 
the idea of the common descent of all the animal forms—1.e., that all the 
Cambrian animals evolved from a common Precambrian ancestor. As 
the textbook Understanding Bioinformatics admits, “The key 
assumption made when constructing a phylogenetic tree from a set of 
sequences is that they are all derived from a single ancestral sequence, 
i.e., they are homologous.”! Or as the Harvard University Press 
textbook The Tree of Life states, “We are obliged to assume at first that, 
for each character, similar states are homologous,” whereby 
“homologous” the text means characters are similar because they share 
a common ancestor.” 

This assumption (of universal common descent) raises the possibility 
that the ancestral entities represented by divergence points in these 
studies are artifacts of the assumptions by which molecular data are 
analyzed. Indeed, the computer programs that are used to compare 
molecular sequences have been written to produce trees showing 
common ancestors and branching relationships regardless of the extent 
to which the genes analyzed may or may not differ. Phylogenetic studies 
compare two or more gene sequences and then use degrees of difference 
to determine divergence points and nodes on a phylogenetic tree. 
Inherent in that procedure is the assumption that the nodes and 
divergence points existed in the past. 

Thus, the deep-divergence studies do not, in any rigorous sense, 
establish any Precambrian ancestral forms. Did a single, original 
metazoan or bilaterian ancestor of the Cambrian animals actually exist? 
The Precambrian—Cambrian fossil record taken on its face certainly 
doesn’t document such an entity. But neither do deep-divergence 
studies. Instead, these studies assume the existence of such ancestors, 


and then merely attempt, given that assumption, to determine how long 
ago such ancestors might have lived. One could argue that the 
conflicting divergence points do at least show that some common 
ancestor existed in the Precambrian, since, despite their conflicting 
results, all divergence studies indicate at least that. But, again, to invoke 
molecular studies that assume the existence of a common ancestor as 
evidence for such an entity only begs the question. Certainly it provides 
no reason for using molecular evidence to trump fossil evidence. 
Perhaps the Precambrian rocks do not record ancestors for the Cambrian 
animals because none existed. To foreclose that possibility, and to 
resolve the mystery of the missing Precambrian ancestral fossils, 
evolutionary biologists cannot use studies that assume the existence of 
the very entity their studies are thought to establish. 


THE “SHMOO”: A CATCH-22 REVISITED 


The concept of deep divergence raises another issue related to my 
discussion at the end of the previous chapter about what would be 
required to document the missing ancestral forms of the Cambrian 
animals. Recall that I argued there that any plausible postulated 
common ancestor to all the animal phyla must have necessarily lacked 
most (or all) of the specific anatomical features that distinguish one 
phylum from another. The more arthropod-like a hypothetical animal 
form, the less plausible it would have been as an ancestor to the 
chordates, mollusks, echinoderms, annelid worms, and vice versa. In 
each case, the design logic and arrangement of parts necessary to 
provide the foundation for one mode of animal life preclude it from 
providing the foundation for other modes of animal life—just as a 
system of parts providing one mode of transportation (with a bicycle, 
for example) will typically preclude functioning as another (as with a 
submarine, for example).** 

For this reason, biologists thinking about the characteristics of the 
earliest ancestor of all the metazoan phyla—the actual animal at the 
deep-divergence point—have typically postulated an extremely simple 


form of life—what one evolutionary biologist described to me as a 
“shmoo,” after the blob-like cartoon character made famous by Al Capp 
in the 1940s and 1950s. Some have proposed that the ur-animal might 
have been something like a placozoan, a modern amorphous animal 
with only four cell types and no bilateral symmetry.*+ Other 
paleontologists have mainly characterized the hypothetical ur-metazoan 
negatively, by reference to the characteristics that it must not have had 
to be a plausible common ancestral form to all other metazoa. (This 
need to characterize the ur-metazoan negatively has led some leading 
paleontologists to question whether the ur-animal can be described 
affirmatively with any specificity.*>) 

In any case, the need to characterize the ur-animal as an extremely 
simple “shmoo-like” form, lacking the numerous characteristics and 
anatomical novelties present in the Cambrian animals, highlights a deep 
dilemma for evolutionary theorists. On the one hand, to be plausible as a 
common ancestor of all the animal phyla, a hypothetical ur-metazoan 
must have few characteristics of later metazoan forms. Indeed, the more 
plausible the hypothetical ancestor, the simpler it must be, meaning it 
will lack more of the specific distinguishing features of the individual 
animal phyla. But that means any evolutionary scenario for the origin of 
the animals that postulates such a “stripped down” animal form as its 
starting point will need to envision those distinguishing characteristics 
arising later. And the fewer the number of characteristics in the 
hypothetical common ancestor, the more such characteristics will need 
to arise later. This logical requirement implies, in turn, the need for an 
even deeper divergence point in Precambrian history and the need for 
more time to produce these specific anatomical novelties—in turn, 
exacerbating the problem of fossil discontinuity. The more plausible the 
hypothetical common ancestor, the deeper the necessary divergence 
point and the greater the morphological discontinuity in the fossil 
record. 

On the other hand, proposing a more complex (and more 
anatomically differentiated) common ancestor closer in its affinities to 
some Cambrian animal forms, would eliminate the need for so deep a 


divergence point. Nevertheless, it would also diminish the plausibility 
of such a hypothetical ancestor as an ur-metazoan common to all the 
other Cambrian animals. Again, the more a hypothetical form resembles 
one of the specific animal forms or phyla, the less plausible it will be as 
an ancestor of all the others. And that is the dilemma. Could there have 
been an animal form simple enough to serve as a viable ancestor 
common to all the animal phyla? Perhaps. But positing such a form only 
deepens the required depth of the divergence point and intensifies the 
already significant problem of Precambrian—Cambrian fossil 
discontinuity. 


DEEP TROUBLE 


Comparative genetic analyses do not establish a single deep-divergence 
point, and thus do not compensate for a lack of fossil evidence for key 
Cambrian ancestors—such as the ur-bilaterian or ur-metazoan ancestor. 
The results of different studies diverge too dramatically to be 
conclusive, or even meaningful; the methods of inferring divergence 
points are fraught with subjectivity; and the whole enterprise depends 
upon a question-begging logic. Many leading Cambrian paleontologists, 
and even some leading evolutionary biologists, now express skepticism 
about both the results and the significance of deep-divergence studies. 
For example, Simon Conway Morris has rejected the idea that such 
studies should trump fossil evidence of a more explosive, shallow and 
rapid Cambrian radiation. After assessing the inconsistent track record 
of deep-divergence studies, he concludes, “A deep history extending to 
an origination in excess of 1,000 Myr [million years] is very unlikely.’*° 
Conway Morris is one of several leading evolutionary biologists and 
Cambrian paleontologists who have expressed skepticism about these 
studies.4’ In any case, there is now little reason to regard the deep- 
divergence hypothesis as a genuine solution to the Cambrian 
conundrum. 


6 
The Animal Tree of Life 


In 2009, in honor of the bicentennial anniversary of Darwin’s birth, a 
piece of artwork was created to adorn the ceiling of an exhibit room at 
the Natural History Museum in London. A paper in the journal Archives 
of Natural History noted that the inspiration for the artwork, titled 
“TREE,” came from a diagram that Darwin had sketched in one of his 
notebooks—a diagram that later came to be known as the “tree of life” 
(see Fig. 2.1la). One BBC radio program called the TREE exhibit the 
“Darwinian Sistine Chapel.”! Another article in Archives of Natural 
History, a journal published by the University of Edinburgh, remarked 
that, “TREE celebrates Darwinian evolutionism” and “secular science 
and reason.” 

For many biologists, the iconic image of Darwin’s tree of life 
represents perhaps the single best distillation of what the science of 
evolutionary biology has to teach, namely the “fact of evolution,” apart 
from which “nothing in biology makes sense.”* Though the fossil record 
does not directly attest to many of the expected intermediate forms 
represented on Darwin’s tree, leading authorities assert that other lines of 
evidence, particularly from genetics, firmly establish Darwin’s tree as 
the correct picture of the history of life. 

In the previous chapter, we saw that there are many good reasons to 
doubt the deep-divergence hypothesis and its claim to have determined, 
based upon genetic evidence, the time at which the Cambrian animals 
began to evolve from specific Precambrian ancestors. Indeed, the idea 
that these studies can pinpoint when an ur-metazoan, or an ur-bilaterian, 


arose has engendered increasing skepticism among a growing number of 
evolutionary biologists and paleontologists. 

The tree of life as a whole, however, is another matter. Many 
evolutionary biologists think the case for universal common descent is 
something close to unassailable because, they argue, analysis of both 
anatomical and genetic similarities converges on the same basic pattern 
of descent from a universal common ancestor. As Richard Dawkins 
asserts, “when we look comparatively at . . . genetic sequences in all 
these different creatures—we find the same kind of hierarchical tree of 
resemblance. We find the same family tree—albeit much more 
thoroughly and convincingly laid out—as we did with . . . the whole 
pattern of anatomical resemblances throughout all the living 
kingdoms.”° Likewise, Jerry Coyne argues that gene sequences 
independently confirm the same set of evolutionary relationships—the 
same basic tree—established from the analysis of anatomy. Oxford 
University chemist Peter Atkins is even more emphatic: “There is not a 
single instance of the molecular traces of change being inconsistent with 
our observations of whole organisms.’’’ 

As a result of this confidence, evolutionary biologists often dismiss 
the missing Precambrian fossil precursors and intermediates as a minor 
anomaly—one awaiting explanation by an otherwise completely 
adequate theory of the history of life. Because most evolutionary 
biologists are confident that a single continuous tree, with a single root, 
best represents the history of life—and explains so many other diverse 
facts of biology—they continue to think the same tree-like pattern also 
accurately describes the Cambrian explosion and the Precambrian 
history of animal life. Moreover, when evolutionary biologists 
reconstruct the phylogenetic history of a group (including animals), they 
typically do so in a time-independent manner. Their concern is usually to 
establish a relative order of branching along the tree of life, not to 
establish or “pinpoint” a series of absolute dates at which divergences 
occurred. Thus, although deep-divergence studies do not establish the 
existence of Precambrian animal ancestors for all the reasons argued in 
the previous chapter, the uncertainty surrounding the dates in these 


studies has not, for most evolutionary biologists, undermined their 
confidence in the overall tree-like pattern of animal life. Instead, many 
evolutionary biologists believe the strength of the case for the tree of life 
as a whole, based on other phylogenetic studies of similar genes and 
anatomical traits, indirectly establishes the existence of the missing 
evolutionary precursors of the Cambrian animals. As Coyne explains, “It 
stands to reason that if the history of life forms a tree, with all species 
originating from a single trunk, then one can find a common origin for 
every pair of twigs (existing species) by tracing each twig back through 
its branches until they intersect at the branch they have in common. This 
node, as we’ve seen, is their common ancestor.’””® 

On the basis of similar logic, evolutionary biologists have typically 
assumed that what they think is true of all other forms of life is true of 
the Cambrian forms—that there must be a universal animal tree, the 
absence of fossil evidence, and the conflicting results of deep-divergence 
studies, notwithstanding. 

To assess the other evidence from genetics that supports this 
conclusion, it’s useful to review how the case for the universal animal 
tree is similar to the deep-divergence hypothesis and also how it is 
different. To establish both the fact and shape of the Darwinian animal 
tree of life, evolutionary biologists have long used methods that assume 
that both molecular sequences and anatomical similarities provide an 
accurate historical signal about the past. Like deep-divergence studies, 
these methods of “phylogenetic” reconstruction assume that the species 
or larger groupings (taxa) are related by descent from a common 
ancestor. (The term phylogeny, again, refers to the evolutionary history 
of a group of organisms. Thus, a “phylogenetic reconstruction” is an 
attempt to determine that history.) Such studies assume that the degree 
of difference between molecular or anatomical features in pairs of 
organisms indicates how long ago they diverged from a common 
ancestor. They also use independent calibrations of the molecular clock 
to calculate the exact divergence times.” 

Unlike deep-divergence studies, however, which attempt to establish 
just a single divergence time—such as that of the common ancestor of 


all the animal phyla—these more detailed phylogenetic studies seek to 
establish the contours of the Precambrian tree of animal life. This 
involves assessing degrees of relatedness among representatives of all 
the Cambrian phyla to establish multiple divergence points and times 
(nodes on the tree of life) as well as the relationships of the major 
Cambrian groups. 

Investigators employ these methods even in the absence of 
corroborating fossil evidence. In his textbook on fossils and evolution, 
following a full-page depiction of the discontinuous appearance of the 
Cambrian animals in the fossil record, Occidental College geologist 
Donald Prothero explains, “If the fossil record is poor in one particular 
group, we look to other sources of data.” He concludes that two such 
sources of data, anatomical and molecular data, now “converge on a 
common answer’—one “that is almost certainly ‘the truth’ (as much as 
we can use that term in science).”!° 

But is all of this true? Does analysis of the genetic and anatomical 
similarity of the Cambrian animals really establish that the history of 
animal life is best depicted as a continuously branching tree? Does the 
pattern of a branching tree accurately depict the history of Precambrian 
and Cambrian animal life, and in so doing establish the existence of 
Precambrian forms that the fossil record fails to document? 





THE PRECAMBRIAN AND CAMBRIAN TREE OF ANIMAL 
LIFE 


History happened once. And if Richard Dawkins is correct that “there is, 
after all, one true tree of life, the unique pattern of evolutionary 
branchings that actually happened,”!! then evolutionary history also 
happened once. Consequently, if we think of evolutionary trees 
describing the relationships of animal groups as hypotheses about an 
unobserved history (which is what they are), then having two or more 
conflicting hypotheses about only one history—the history that actually 
happened—means that we haven’t figured out what did happen. A 
widely used textbook on phylogenetic methods explains this: “The fact 


that there is only one true tree . . . provides the basis for testing 
alternative hypotheses. If two hypotheses are generated for the same 
group of species, then we can conclude that at least one of these 
hypotheses is false. Of course, it is possible that both are false and some 
other tree is true.”!? 

When a body of evidence supports multiple conflicting historical 
hypotheses, the evidence cannot be sending a definitive historical signal 
about what happened in the past. That raises the possibility that it may 
not be sending a signal at all. Conversely, when the evidence leads 
investigators to converge around a single historical hypothesis, when 
one hypothesis best explains a whole group of clues, it is much more 
likely that the evidence is telling us what actually happened. 

Consider, by way of illustration, a case in which we know a true 
history of ancestor—descendant relationships to see how evidence can 
converge around a single (unequivocal) history. Between 1839 and 1856, 
Charles Darwin and his wife, Emma, produced ten children, listed below 
in alphabetical order: 


Anne 
Charles 
Elizabeth 
Francis 
George 
Henrietta 
Horace 
Leonard 
Mary 
William 


This alphabetical listing, of course, is not their actual birth order. 
Instead, it is one of a large number of possible birth orders for Darwin’s 
children, only one of which is the correct sequence. Indeed, only one of 
these arrangements can represent the actual Darwin family history. 


Now, suppose I gave you and some friends a pile of historical 
evidence about Darwin’s children, and asked you to “solve for their birth 
order.” No one would consider the problem solved if you came back 
with more than one order. On the other hand, if you came back and 
presented a single coherent hypothesis of the birth order supported by 
evidence from birth records, family letters, and photographs from the 
Darwin family archives, that would provide persuasive evidence that 
you had obtained the correct solution. Since only one true history exists, 
once you find it the evidence will tend to fall naturally into place. 

But does the evidence for a Precambrian animal tree of life fall 
similarly into place or does it generate multiple conflicting histories? 
We’ve already seen that fossil evidence does not point to a specific 
Precambrian tree of animal life, or perhaps to any tree at all. We’ve also 
seen that genetic evidence by itself does not establish a single 
divergence point for animal evolution. But what about the genetic and 
anatomical evidence taken together? Does that evidence converge on a 
single history of animal life? If so, then it could well make up for a lack 
of fossil evidence. Otherwise, it would seem to raise an obvious 
question: Are the observed genetic and anatomical “affinities” among 
the Cambrian phyla sending reliable historical signals at all? 


CONFLICTING HISTORIES 


There are several reasons to doubt that evidence of genetic and 
anatomical similarity is sending a reliable signal of the early history of 
animal life. First, comparisons of different molecules frequently generate 
divergent trees. Second, comparisons of anatomical characteristics and 
molecules frequently produce divergent trees. Third, trees based solely 
on different anatomical characteristics often contradict each other. Let’s 
examine each problem. 


Molecules vs. Molecules 


Just as the molecular data do not point unequivocally to a single date for 
the last common ancestor of all the Cambrian animals (the point of deep 


divergence), they do not point unequivocally to a single coherent tree 
depicting the evolution of animals in the Precambrian. Numerous papers 
have noted the prevalence of contradictory trees based on evidence from 
molecular genetics. A 2009 paper in Trends in Ecology and Evolution 
notes that “evolutionary trees from different genes often have conflicting 
branching patterns.”!? Likewise, a 2012 paper in Biological Reviews 
acknowledges that “phylogenetic conflict is common, and frequently the 
norm rather than the exception.” 14 Echoing these views, a January 2009 
cover story and review article in New Scientist observed that today the 
tree-of-life project “lies in tatters, torn to pieces by an onslaught of 
negative evidence.” As the article explains, “Many biologists now argue 
that the tree concept is obsolete and needs to be discarded,” because the 
evidence suggests that “the evolution of animals and plants isn’t exactly 
tree-like.” 

The New Scientist article cited a study by Michael Syvanen, a 
biologist at the University of California at Davis, who studied the 
relationships among several phyla that first arose in the Cambrian.!> 
Syvanen’s study compared two thousand genes in six animals spanning 
phyla as diverse as chordates, echinoderms, arthropods, and nematodes. 
His analysis yielded no consistent tree-like pattern. As the New Scientist 
reported, “In theory, he should have been able to use the gene sequences 
to construct an evolutionary tree showing the relationships between the 
six animals. He failed. The problem was that different genes told 
contradictory evolutionary stories.” Syvanen himself summarized the 
results in the bluntest of terms: “We’ve just annihilated the tree of life. 
It’s not a tree anymore, it’s a different topology [pattern of history] 
entirely. What would Darwin have made of that?”’!® 

Other studies trying to clarify the evolutionary history and 
phylogenetic relationships of the animal phyla have encountered similar 
difficulties. Vanderbilt University molecular systematist Antonis Rokas 
is a leader among biologists using molecular data to study animal 
phylogenetic relationships. Nevertheless, he concedes that a century and 
a half after The Origin of Species, “a complete and accurate tree of life 
remains an elusive goal.”!’ In 2005, during the course of an authoritative 


study he eventually copublished in Science, Rokas was confronted with 
this stark reality. The study had sought to determine the evolutionary 
history of the animal phyla by analyzing fifty genes across seventeen 
taxa. He hoped that a single dominant phylogenetic tree would emerge. 
Rokas and his team reported that “a 50-gene data matrix does not 
resolve relationships among most metazoan phyla” because it generated 
numerous conflicting phylogenies and historical signals. Their 
conclusion was candid: “Despite the amount of data and breadth of taxa 
analyzed, relationships among most metazoan phyla remained 
unresolved.”!® 

In a paper published the following year, Rokas and University of 
Wisconsin at Madison biologist Sean B. Carroll went so far as to assert 
that “certain critical parts of the TOL [tree of life] may be difficult to 
resolve, regardless of the quantity of conventional data available.”!° 
This problem applies specifically to the relationships of the animal 
phyla, where ‘{m]any recent studies have reported support for many 
alternative conflicting phylogenies.” Investigators studying the animal 
tree found that “a large fraction of single genes produce phylogenies of 
poor quality” such that in one case, a study “omitted 35% of single 
genes from their data matrix, because those genes produced phylogenies 
at odds with conventional wisdom.”?! Rokas and Carroll tried to explain 
the many contradictory trees by proposing that the animal phyla might 
have evolved too quickly for the genes to record some signal of 
phylogenetic relationships into the respective genomes. In their view, if 
the evolutionary process responsible for anatomical novelty works 
quickly enough, there would not be sufficient time for differences to 
accumulate in key molecular markers, in particular those used to infer 
evolutionary relationships in different animal phyla. Then, given enough 
time, whatever signal did exist might become lost. Thus, when groups of 
organisms branch rapidly and then evolve separately for long periods of 
time, this “can overwhelm the true historical signal”*?—leading to the 
inability to determine evolutionary relationships. 

Their article brings the discussion of the Cambrian explosion full 
circle from an attempt to use genes to compensate for the absence of 


fossil evidence to the acknowledgment that genes do not convey any 
clear signal about the evolutionary relationships of the phyla first 
preserved by fossils in the Cambrian. The logic of their analysis also 
leads them to a strangely familiar conclusion. Since the analysis of key 
genetic markers—like the genes tracked in molecular-clock studies that 
presumably accumulate mutations at a constant rate—shows a low 
number of mutational differences between the Cambrian animal phyla, 
Rokas and Carroll conclude from specifically genetic evidence that the 
phyla must have diverged rapidly. As they put it in another paper, 
“Inferences from these two independent lines of evidence (molecules 
and fossils) support a view of the origin of Metazoa as a radiation 
compressed in time.” Thus, the inability to reconstruct the 
evolutionary history of the animal phyla from the molecular data not 
only fails to establish a Precambrian pattern of descent; it ironically also 
reaffirms the extreme rapidity of the origin of the Cambrian animal 
forms. 


Molecules vs. Anatomy 


In 1965, chemist Linus Pauling and biologist Emile Zuckerkandl, often 
hailed as the fathers of the molecular-clock concept, proposed a rigorous 
way to confirm evolutionary phylogenies. They suggested that if studies 
of comparative anatomy and DNA sequences generated similar 
phylogenetic trees, then “the best available single proof of the reality of 
macroevolution would be furnished.”** As they went on to explain, 
“only the theory of evolution . . . could reasonably account for such a 
congruence between lines of evidence obtained independently.”*° By 
focusing attention on these two independent lines of evidence and the 
possibility of their convergence (or conflict), Pauling and Zuckerkandl 
provided a clear and measurable way to test the neo-Darwinian thesis of 
universal common ancestry. 

And according to some scientists, studies of molecular homologies 
have confirmed expectations about the history of the animal phyla 
derived from studies of comparative anatomy. After citing Pauling and 


Zuckerkandl’s test, Douglas Theobald claims in his “29+ Evidences for 
Macroevolution” that “well-determined phylogenetic trees inferred from 
the independent evidence of morphology and molecular sequences 
match with an extremely high degree of statistical significance.”*° 

In reality, however, the technical literature tells a different story. 
Studies of molecular homologies often fail to confirm evolutionary trees 
depicting the history of the animal phyla derived from studies of 
comparative anatomy. Instead, during the 1990s, early into the 
revolution in molecular genetics, many studies began to show that 
phylogenetic trees derived from anatomy and those derived from 
molecules often contradicted each other. 

Probably the most protracted conflict of this type concerns a widely 
accepted phylogeny for the bilaterian animals. This classification 
scheme was originally the work of the influential American zoologist 
Libbie Hyman.’ Hyman’s view, generally known as the “Coelomata” 
hypothesis, was based on her analysis of anatomical characteristics, 
mainly germ (or primary tissue) layers, planes of body symmetry, and 
especially the presence or absence of a central body cavity called the 
“coelom,” which gives the hypothesis its name. In the Coelomata 
hypothesis, the bilaterian animals were classified in three groups, the 
Acoelomata, the Pseudocoelomata, and the Coelomata, each 
encompassing several different bilaterian animal phyla.2® (See Fig. 
6.1a.) 

Then, in the mid-1990s, a very different arrangement of these animal 
groups was proposed based on the analysis of a molecule present in each 
(the 18S ribosomal RNA; see Fig. 6.1b). The team of researchers who 
proposed this arrangement published a groundbreaking paper in Nature 
with a title that surprised many morphologists: “Evidence for a Clade of 
Nematodes, Arthropods and Other Moulting Animals.”?? The paper 
noted the conventional wisdom, based on Hyman’s hypothesis, that 
arthropods and annelids were closely related because both phyla had 
segmented body plans.*° But their study of the 18S ribosomal RNA 
suggested a different grouping, one that placed arthropods close to 
nematodes within a group of animals that molt, which they called 


“Ecdysozoa.” This relationship surprised anatomists, since arthropods 
and nematodes don’t exactly look like kissing cousins. Arthropods (such 
as trilobites and insects) have coeloms, whereas nematodes (such as the 
tiny worm Caenorhabditis elegans) do not, leading many evolutionary 
biologists to believe nematodes were early branching animals only 
distantly related to arthropods.*! The Nature paper explained how 
unexpected this grouping of arthropods and nematodes was: 
“Considering the greatly differing morphologies, embryological features, 
and life histories of the molting animals, it was initially surprising that 
the ribosomal RNA tree should group them together.” 
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How scientists reconstruct evolutionary history depends on which 
similarities they regard as revealing the true history of descent 
(homology) and which similarities they regard as misleading 
(homoplasy). Advocates of the Coelomata hypothesis (Figure 6.la) 
regard the coelom (body cavity) as a homologous feature. Thus, they 
think the presence of a coelom in both arthropods and vertebrates 
indicates a common ancestor that possessed a coelom (indicated by the 
solid horizontal line in 6.la). But advocates of the Ecdysozoa hypothesis 
(6.1b) think the coelom evolved at least twice independently (indicated 
by the two dashed horizontal lines in Figure 6.1b). They regard the 
presence of the coelom as a historically misleading similarity—one that 


does not indicate the presence of that feature in the most recent common 
ancestor of the groups possessing it. These two hypotheses and their 
implied histories are not congruent, and cannot both be true. 


Since the Ecdysozoa hypothesis was first proposed, other scientists 
have vigorously opposed it, reaffirming the Coelomata hypothesis, based 
on the analysis of other molecular evidence.’ Advocates of the 
Ecdysozoa grouping pushed back hard, however,** contending that, 
properly interpreted, available genetic evidence supports the Ecdysozoa, 
not the Coelomata hypothesis.’ 

My point in summarizing these disputes is simply to note that the 
molecular and anatomical data commonly disagree, that one can find 
partisans on every side, that the debate is persistent and ongoing, and 
that, therefore, the statements of Dawkins, Coyne, and many others 
about all the evidence (molecular and anatomical) supporting a single, 
unambiguous animal tree are manifestly false. As can readily be seen by 
comparing Figures 6.la and 6.1b,°° these hypotheses—Coelomata and 
Ecdysozoa—contradict each other. Although both might be false, both 
cannot be true. 

Various papers analyzing other groups have found similar 
discrepancies between molecular and morphological versions of the 
animal tree. A paper by Laura Maley and Charles Marshall in the journal 
Science noted, “Animal relationships derived from these new molecular 
data sometimes are very different from those implied by older, classical 
evaluations of morphology.”?’ For example, when tarantulas were used 
as the representative of arthropods, the arthropods were grouped more 
closely to mollusks than to deuterostomes (animals that develop anuses 
first and mouths later). This makes sense because both mollusks and 
arthropods are protostomes (animals that develop mouths first and 
anuses later). But when brine shrimp were used as the representative of 
the arthropods, the arthropods became the odd man out. Now mollusks 
were grouped most closely to deuterostomes, far away from arthropods 
—a result clearly at odds with the conventional phylogeny based upon 
anatomical characteristics.*® 


Examples of similar conflicts abound. The traditional phylogeny 
placed sponges at the bottom of the animal tree, with progressively more 
complex phyla (e.g., cnidarians, flatworms, nematodes) branching off. 
But Valentine, Jablonski, and Erwin note that molecules “indicate a very 
different configuration” of the tree, where some higher deuterostome 
phyla branch off very early and some comparatively less complex phyla 
branch very late.*? 

Likewise, morphological studies suggest phoronids (see Fig. 3.6) and 
brachiopods (see Fig. 1.3), both marine filter-feeding animals, are 
deuterostomes, but molecular studies classify them within 
protostomes.*? Morphological studies typically imply that sponges are 
monophyletic (all part of an exclusive branch on the tree of life) because 
of their distinctive body architecture, but molecular studies suggest that 
sponges don’t belong to a single unified group, with some sponges more 
closely related to jellyfish than they are to other sponges.*! Cnidarians 
and ctenophores have similar body plans, leading many to expect they 
were closely related on the basis of morphology. But molecular data 
have distanced these phyla significantly.4* As a major review article in 
Nature in 2000 observes, “Evolutionary trees constructed by studying 
biological molecules often don’t resemble those drawn up from 
morphology.’”*? And the problem isn’t getting better over time. A 2012 
paper admits that larger datasets are not solving this problem: 
“Incongruence between phylogenies derived from morphological versus 
molecular analyses and between trees based on different subsets of 
molecular sequences has become pervasive as datasets have expanded 
rapidly in both characters and species.””*4 

Indeed, the widespread discrepancies between molecular data and 
morphological data and between various molecule-based trees have led 
some to conclude that Pauling and Zuckerkandl were wrong to assume 
that the degree of similarity indicates the degree of evolutionary 
relatedness.*° As Jeffrey H. Schwartz and Bruno Maresca put it in the 
journal Biological Theory: “This assumption derives from interpreting 
molecular similarity (or dissimilarity) between taxa in the context of a 
Darwinian model of continual and gradual change. Review of the history 


of molecular systematics and its claims in the context of molecular 
biology reveals that there is no basis for the ‘molecular assumption.’ ””*° 


Anatomy vs. Anatomy 


Attempts to infer a consistent picture of the history of animal life based 
on analyzing the anatomical characteristics of different animals have 
also proven problematic. In the first place, there is a general and long- 
standing problem with attempts to infer the evolutionary history of the 
animal phyla from similar anatomical traits. At the level of the phyla— 
that is, when one compares the phyla to each other and tries to determine 
their branching order—the number of shared anatomical characteristics 
available for inferring evolutionary relationships drops off quite 
dramatically. There is an obvious reason for this. For example, an 
anatomical character such as the “leg,” that is useful for diagnosing and 
comparing arthropods, which possess legs, proves useless for making 
comparisons between (for example) brachiopods or bryozoans, which do 
not. In the same way, basic structural features of human-designed 
systems, such as the distinctive submarine “trait” of an encapsulating 
watertight hull, might help to distinguish it from a cruise ship, which is 
only watertight on its underside. But this “trait” would be irrelevant for 
comparing and classifying, say, suspension bridges, motorcycles, or flat- 
screen televisions. In a similar manner, biologists find that there are only 
a handful of highly abstract characters, such as radial versus bilateral 
body symmetry, the number of fundamental tissue layers (triploblasty, 
three layers, versus diploblasty, two layers), or the type of body cavity 
present (true coelom, pseudocoelom, or no coelom), available for 
morphological comparisons of the many diverse animal forms. Yet 
evolutionary biologists have often disputed the validity of these rather 
abstract traits as guides to evolutionary history.4’ In addition, just as 
trees based upon the analysis of different sets of similar genes or 
proteins often conflict, trees constructed on the basis of different 
developmental and anatomical characteristics often conflict. 


When biologists construct phylogenetic trees based upon anatomical 
characteristics, they typically group the animal phyla according to the 
presence or absence of several key characteristics. For example, the 
standard version of the animal tree, based upon anatomy, groups animals 
according to their style of body plan symmetry and by their mode of 
body plan development. As noted earlier, animals with mirror symmetry 
along their vertical head-to-tail axes all fall within the Bilaterian group. 
Animals with radial symmetry (or no symmetry) fall outside this group. 
Within the Bilateria, taxonomists distinguish other main groups— 
protostomes and deuterostomes—based upon their differing modes of 
body plan development—1.e., either “mouth first” or “anus first.” 

Yet a significant difficulty arises when evolutionary biologists 
consider how a particularly fundamental characteristic—the mode of 
germ-cell formation—is distributed among various groups on the 
canonical animal tree of life (see Fig. 6.2).48 Germ cells produce eggs 
and sperm (in any sexually reproducing species) or gametes (in any 
asexually reproducing species), giving rise to the next generation.*” 
Animals have two main ways of generating germ cells. In one mode of 
germ-cell formation, known as preformation, cells inherit internal 
signals from a region within their own cell structure to become germ 
cells (illustrated by solid black squares on Fig. 6.3a). In the other main 
way of generating germ cells, known as epigenesis, germ cells receive 
external signals from surrounding tissues to become primordial germ 
cells (PGCs, illustrated by solid white squares in Fig. 6.3b). 

Germ-cell formation has indisputable evolutionary importance. To 
evolve, a population or a species must leave offspring; to leave 
offspring, species of animals must generate primordial germ cells. No 
PGCs, no reproduction; no reproduction, no evolution. 

One might expect, therefore, that if a group of animals is all derived 
from a common ancestor (with a particular mode of gamete production), 
then the mode of germ-cell formation should also be essentially the same 
from one animal species to the next in that group. Further, assuming the 
common ancestry of all animals, our expectation of homologous modes 
of germ-cell formation among the animals ought to be higher than for 


any other tissue type, cell line, or mode of development. Why? Because 
mutations affecting the developmental mechanisms that govern PGC 
formation inevitably disrupt successful reproduction.°? Again, if a 
species cannot reproduce, it cannot evolve.>! 

Thus, similar groups of animals—indeed, all animals, if they have 
descended from a common ancestor—ought to exhibit the same basic 
mode of germ-cell formation. Further, that the evolutionary tree derived 
from an analysis of the “mode of germ-cell formation” ought to be 
congruent with the trees derived from other such fundamental 
characteristics (such as body-plan symmetry, mode of development, 
number of primary tissues, and so forth). 

But the mode of germ-cell formation is nearly randomly distributed 
among the different animal groups, making it impossible to generate a 
coherent tree based on this characteristic, let alone making any 
comparison between such a tree and the canonical tree. Note also the 
distribution of the two basic modes of germ-cell development within the 
animal phyla as depicted on the canonical tree. Figure 6.4, derived from 
the work of Harvard developmental biologist Cassandra Extavour,>* 
shows this distribution and provides another way of understanding the 
incongruence that arises when analyzing different anatomical 
characteristics. 
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FIGURE 6.2 
The canonical tree of the Metazoa as determined by the analysis of 
selected anatomical characters and genes. 





A. PREFORMATION B. EPIGENESIS 


FIGURE 6.3 

Two modes of germ-cell formation. Figure 6.3a (left): In fruit flies as 
the egg is being formed, the mother’s nurse cells (the four cells on the 
left of the oval-shaped egg chamber, indicated by large black circles) 
deposit proteins and RNAs, which are transported to the posterior pole 
of the egg (indicated by the dark patch to the right of the large cell on the 
right). These maternally synthesized molecules then trigger the 
development of the germ cells and sex organs of the fly during 
embryogenesis. Figure 6.3b (right): In the eggs of mice there are no 
maternally deposited products that determine germ-cell formation. 
Rather, as the embryo develops a subpopulation of cells (represented by 
the triangles on the left) express “germline competence genes.” These 
cells then “read” signals that arrive from other tissues (see arrows), 
causing the cells to differentiate into primordial germ cells (as indicated 
by the stars on the right). 


Notice the two modes of germ-cell formation do not cluster together 
in separate parts of the canonical tree. Instead, they are distributed 
haphazardly among various phyla on different branches of the tree. In 
the protostomes, for instance, modes of germ-cell formation wink on and 
off between preformation and epigenesis. The same is the case within 


the deuterostomes: germ-cell formation varies almost randomly, and 
several groups exhibit both modes, rendering it difficult or impossible to 
determine which characteristic was present at different ancestral 
branching points. Noting this pattern of distribution, Cassandra Extavour 
concludes that “the data presently available cannot suggest homologies 
of the somatic components of metazoan gonads.”> 
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FIGURE 6.4 

The distribution of modes of primordial germ-cell formation (epigenesis, 
preformation, or both) among various animal groups. The solid thin lines 
between the boxes on the right and the phyla names on the left show 
where the different modes of germ-cell formation are present in different 
phyla. The nearly random distribution of types of germ-cell formation 
among the various animal groups makes it impossible to generate an 
animal phylogeny (evolutionary history) based upon this character that 
will match the evolutionary history implied by the canonical animal tree 
of life. 
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FIGURE 6.5 

A selection of incompatible (mutually incongruent) phylogenetic trees 
representing the history of the major groups of animals, drawn from the 
zoological and evolutionary literature (1940-present). Note: The 
definitions of some taxonomic groups in some of these phylogenetic 
trees may have changed significantly since the time those phylogenies 
were originally constructed. Branch lengths may not always be drawn to 
scale. 


After completing a survey of many such difficulties, University of St. 
Andrews zoologist Pat Willmer and Oxford University zoologist Peter 
Holland, experts on invertebrate anatomy, draw this conclusion: “Taken 
together, modern re-evaluations of traditional evidence support different 
and mutually exclusive subsets of [phylogenetic] relations.”°+ They go 
on to observe that “patterns of symmetry, the number of germ layers in 
the body, the nature of the body cavity, and the presence or type of serial 
repetition [segmentation] have all been used to infer common ancestry.” 
But, they explain, the phylogenetic story these characteristics tell is 
“now either unacceptable or at least controversial” because the data are, 
at best, inconsistent.>> 

The historical record of ongoing uncertainty about the animal tree of 
life since 1859 confirms, as one respected textbook on invertebrate 
animals explains, that “phylogenetic analysis at the level of the phyla is 
highly problematical.” As a result, “the study of higher level animal 
phylogeny has yielded an expansive literature but relatively little 
detailed consensus. . . . In point of fact, there exists no such thing as ‘the 
traditional textbook phylogeny.’ A diversity of different schemes can be 
found.”>’ To appreciate this problem visually, look at Figures 6.la and 
6.lb as well as Figure 6.5. These show some of the many metazoan 
phylogenies, based on anatomy, published in the twentieth century. 
These branching patterns plainly do not agree with one other. 


THE ASSUMPTIONS OF PHYLOGENETIC INFERENCE 


All these problems underscore several fundamental difficulties with the 
methods of phylogenetic reconstruction. When biologists analyze 
multiple anatomical traits or genes, the animal phyla consistently defy 
attempts to arrange them into the pattern of a single tree. Yet if there was 
a period of hidden Precambrian evolution, and if comparative sequence 
analyses reveal the actual history of animal life and, by implication, the 
existence of Precambrian animal forms, phylogenetic studies should 
converge more and more around a single tree of animal life. Just as only 
one possible divergence point could represent the event at which animal 
forms began to evolve from a common animal ancestor, only one of the 
many trees produced by phylogenetic analysis can represent the actual 
Precambrian history of animal life. If, instead, phylogenetic analyses 
consistently generate different possible evolutionary histories, it’s 
difficult to see how any one of them could be known to be sending a 
reliable historical signal. Again, the history of animal life only happened 
once. 

One could argue that these conflicting trees do, at least, show that 
some tree-like evolutionary pattern of common ancestry preceded the 
Cambrian, since all conflicting trees do affirm that. But, again, they all 
“show” this because they all presuppose it, not because they demonstrate 
it. 


CONVERGENT EVOLUTION 


There is yet another reason to wonder whether studies of anatomical or 
molecular homology convey anything definitive about the history of life. 
Many animals have single traits or features in common with otherwise 
decidedly different animals. In such cases, it makes no evolutionary 
sense to classify these forms as closely related ancestors. For example, 
moles and mole crickets have remarkably similar forelimbs, though 
moles are mammals and mole crickets are insects. No evolutionary 
biologist regards these two animals as closely related, for understandable 
reasons. 


The theory of universal common descent assumes that, generally, the 
more similar two organisms are, the more closely related they must be. 
Assuming common descent, animals with wildly differing body plans 
should not be closely related. The presence of nearly identical individual 
traits or structures within organisms exemplifying otherwise different 
body plans cannot, therefore, be attributed to evolution from a common 
ancestor. Instead, evolutionary biologists attribute similar traits or 
structures in such a context to so-called convergent evolution, the 
separate or independent origin of similar characters emerging on 
separate lines of descent after the point at which those lines diverged 
from their last common ancestor. Convergent evolution demonstrates 
that similarity does not always imply homology, or inheritance from a 
common ancestor. 

For this reason, the repeated need to posit convergent evolution (and 
other related mechanisms)°® casts further doubt on the method of 
phylogenetic reconstruction. Invoking convergent evolution negates the 
very logic of the argument from homology, which affirms that similarity 
implies common ancestry, except—we now learn—in those many, many 
cases when it does not. Repeatedly invoking convergence negates the 
assumption that justified the method of phylogenetic reconstruction in 
the first place, namely, that similarity is a reliable historical signal of 
common ancestry. 


A FAMILY REUNION? 


So what lesson should we draw from these many conflicting trees? 
Clearly, these contradictory results call into question the existence of a 
canonical tree of animal life. To see why, imagine being invited to an 
event billed as an extended-family reunion where you’ve been told you 
will encounter hundreds of your relatives, most of whom you have never 
met. Let’s call the description in the invitation the “reunion hypothesis.” 
The invitation says that a group photograph is planned, for which 
relatives will be grouped together according to their degree of 
relationship (first cousins with first cousins, and so on). 


You show up and grab a coleslaw, eager to meet these many 
previously unknown relatives. You see the hundreds gathered and have 
every reason to believe that the “reunion hypothesis” is true. After all, 
the invitation in your mailbox described the event as a family reunion. 

As the day goes on, however, something seems amiss. Here and 
there, you see familiar facial features—“Yes,” you think, that person 
could be my cousin’—but the majority of attendees and all the strangers 
you engage in conversation exhibit no discernible family resemblances. 
Nor does anyone seem to share any personal relationships with anyone 
else at the reunion, no matter how long they chat and try to establish 
points of commonality. What’s more, each person tells a different story 
of his or her family history. You try to group the strangers by physical 
characteristics (height, hair color, body type, and so on), but the 
characteristics that you find fail to yield evidence of family ties or 
genealogical connections. Whatever points of commonality you find 
refuse to assemble into a coherent, consistent story. The pedigrees are 
unclear. The reunion hypothesis is under major strain. 

When the photographer arrives, she gamely tries to draw everyone 
together for the planned photograph. Chaos ensues. No one knows 
where to stand, because the family relationships are so unclear, if they 
exist at all. After milling around hopelessly for half an hour or so, people 
depart for their cars, wondering why they bothered to attend the picnic, 
and indeed why they were invited in the first place. 

Do you now have good reasons to doubt the “reunion hypothesis”? 
You do. If the family reunion hypothesis were true, it would have been 
increasingly confirmed, because the evidence would have converged on 
a single consistent pattern. Had there been a true pattern of family 
relationships, the longer everyone talked, the more a single coherent 
pattern of relationship would have become apparent. But the people at 
the picnic were not your relatives, at least not in any way you could 
determine to be true. Instead of converging on a single pattern, the 
“evidence” was all over the map. 


A FOREST OF TREES 


Of course, my family reunion illustration breaks down as an analogy to 
the history of animal life, because if we could trace the history of all the 
people at the reunion back far enough we would find that they are all 
related by common ancestry. Though we can choose to assume that the 
same is true of the Cambrian animals, neither the fossil evidence nor the 
evidence of genetics and comparative anatomy actually establishes that. 
These three classes of evidence either provide no compelling evidence 
for Precambrian animal ancestors (in the case of fossils), or they provide 
question-begging and conflicting evidence (in the case of genes and 
anatomy). 

And that is the point of my story. Since there can be only one true 
history of the Cambrian animals, the evidence should converge on a 
common family tree—if indeed we are looking at evidence of true 
history. The picture given by the evidence should be stable, not 
constantly changing. But the evidence from a variety of quarters has 
instead continually generated new, conflicting, and incoherent pictures 
of the history of animal life. As with the “cousins” in my illustration, 
there seems to be no consistent and coherent way to organize the animal 
groups into a family tree. 

But if the genes don’t tell the story of Precambrian ancestral forms, if 
they don’t compensate for a dearth of fossil evidence and establish a 
unequivocal long cryptic history of animal life from an original animal, 
an ur-metazoan, then logically we are back to taking the fossil record at 
face value. In that case, the mystery of the missing ancestral fossils 
remains. If so, is there any way to explain the abrupt appearance of new 
forms of life in the fossil record within an evolutionary framework? 
During the 1970s, two young paleontologists thought that there just 
might be a way to do exactly that. 


7 
Punk Eek! 


Scientific discoveries are rarely made in Laundromats, but at least one 
great scientific breakthrough—an “aha” moment—occurred in one. The 
year was 1968, more than a decade before the discovery of the first 
Chengjiang fossils. The scientist overtaken by the muses was 
paleontologist Niles Eldredge. One day while standing in a Michigan 
Laundromat, following months of collecting trilobite fossils for his 
Ph.D. research, Eldredge happened to reach into his pocket. He removed 
one of the fossils he had been collecting, a specimen of a trilobite 
species called Phacops rana. Initially, as he examined the specimen, he 
felt “depressed.” The fossil closely resembled many others that he had 
found across layers of strata during his fieldwork in the Midwest. His 
trilobites showed no evidence of gradual change, as classical neo- 
Darwinism had taught him to expect. ! 

As Eldredge explained in a lecture at the University of Pittsburgh in 
1983, he then experienced a kind of scientific epiphany. He realized that 
the “absence of change itself” was “a very interesting pattern.” Or as he 
later put it, “Stasis is data.”? “Stasis” is the term that Eldredge and his 
scientific collaborator, Stephen Jay Gould (see Fig. 7.1), later gave to 
the pattern in which most species, “during their geological history, 
either do not change in any appreciable way or else they fluctuate 
mildly in morphology, with no apparent direction.”? As Eldredge 
examined that solitary trilobite, he realized that he had been observing 
evidence of stasis for some time—however much he might have wanted 
it otherwise. As he explained, “Stasis ... was by far the most important 


pattern to emerge from all my staring at Phacops specimens.” He 
continued, “Traditionally seen as an artifact of a poor record, as the 
inability of paleontologists to find what evolutionary biologists going 
back to Darwin had told them must be there, stasis was, as Stephen Jay 
Gould put it, ‘paleontology’s trade secret-—an embarrassing one at 
that.” 

This embarrassing realization proved pivotal, eventually leading 
Eldredge and Gould to reject both the gradualistic picture of 
evolutionary change articulated by Darwin and the neo-Darwinian 
understanding of the mechanism by which such change allegedly takes 
place. It also led them to formulate, in a series of scientific papers from 
1972 to 1980, a new theory of evolution known as “punctuated 
equilibrium.” 

As a consequence of this theory, neither Gould nor Eldredge 
expected to find a wealth of transitional intermediate forms in the fossil 
record. In their view, the main periods of biological innovation simply 
occurred too rapidly to leave many fossil intermediates behind.° 

Gould and Eldredge sought to explain the occurrence of the rapid 
periods’ of change (i.e., the punctuations) as the by-product of different 
kinds of evolutionary mechanisms or processes of change. They 
proposed, first, a mechanism called “allopatric speciation” to explain 
the rapid generation of new species. Gould, Eldredge, and another early 
advocate of punctuated equilibrium named Steven Stanley, a 
paleontologist at Johns Hopkins University, also proposed that natural 
selection operated on higher levels. Rather than natural selection 
favoring the fittest individual organisms within a species—as it does in 
classical Darwinism and neo-Darwinism—these paleontologists 
proposed that it often selected the most fit species among a group of 
competing species. Because they thought that speciation occurred more 
rapidly, and because they thought that natural selection acted on whole 
species and not just individual organisms, the advocates of punctuated 
equilibrium theorized that morphological change typically occurs in 
larger, more discrete jumps than Darwin first envisioned. 





FIGURE 7.1 

Figure 7.la (left): Stephen Jay Gould. Courtesy Getty Images. Figure 
7.1b (right): Niles Eldredge. Copyright © Julian Dufort 2011. Used 
with permission. 


Thus, in one sense, the theory of punctuated equilibrium, like the 
artifact hypothesis, sought to explain the absence of the transitional 
intermediate forms that were expected based on Darwin’s theory. By 
repudiating Darwinian gradualism, the advocates of punctuated 
equilibrium sought to account for the absence of transitional forms in 
the fossil record apart from the artifact hypothesis or, at best, using what 
they imagined as a more modest version of it. But in repudiating 
Darwinian gradualism, punctuated equilibrium also represented a 
radically different view of the pace and mode of evolution—a new 
theory of evolution that purported to identify a new mechanism of 
evolutionary change. As historian of science David Sepkoski explains, 
“Gould and Eldredge proposed a radical revision of this standard [ neo- 
Darwinian] narrative. They argued that the pattern of evolutionary 
history really was composed of fits and starts, consisting of long periods 


of evolutionary stasis (or ‘ equilibrium’) ‘ punctuated’ by shorter periods 
of rapid speciation.”® 

During the 1970s and 1980s, the theory of punctuated equilibrium, or 
“punk eek” as it is affectionately known, generated both intense 
scientific debate and extensive media coverage.’ Critics called the 
model “evolution by jerks,” leading Gould to reply that proponents of 
gradualism were offering “evolution by creeps.”!° Though initially 
Eldredge played more of a role in formulating the theory, Stephen Jay 
Gould emerged as the leading spokesman for it. As a result of his 
advocacy of the theory as well as his popular science writing, Gould 
achieved immense celebrity—celebrity that has in turn secured an 
enduring place for punctuated equilibrium in scientific awareness. 

So what has become of this bold scientific proposal? Does 
punctuated equilibrium solve the problems that traditional neo- 
Darwinism does not? Does it help explain the Cambrian explosion and 
the missing fossil intermediates that render it so mysterious? 


WANTED: FAST ENGINE 


Once they decided to take the fossil record at face value, the question 
for Gould and Eldredge was obvious: What could generate evolutionary 
change so rapidly? To explain the short bursts or punctuations, Gould 
and Eldredge proposed a mechanism of rapid speciation to which 
Stanley added (with their agreement) a new understanding of the 
mechanism of natural selection.!! 

Whereas the neo-Darwinian mechanism of natural selection acting 
on random mutations necessarily acts slowly and gradually, Gould and 
Eldredge invoked a process called “allopatric speciation” to explain 
how new species might arise quickly. The prefix allo means “other” or 
“different,” and the suffix patric means “father.” Thus, allopatric 
speciation refers to processes that generate new species from separate 
parent (or “father”) populations. Allopatric speciation typically occurs 
when part of a population of organisms becomes geographically isolated 
—perhaps by the emergence of a mountain range or the shifting of a 


river’s course—from a larger parent population and then a daughter 
population changes in response to differing environmental pressures. 

Gould and Eldredge drew on insights from population genetics to 
explain why new genetic traits were more likely to spread and establish 
themselves within these smaller subpopulations. Population genetics, a 
subject to which I'll return in Chapter 12, describes the processes by 
which genetic traits change and become fixed in a population of 
organisms. It teaches that in typically /arge populations of organisms, it 
is difficult for a newly arising genetic trait to spread throughout an 
entire population. Yet for any evolutionary change to occur in a 
population, new genetic traits must become widespread, or “fixed,” by a 
process called “fixation.” 

In smaller populations, however, the probability of a newly arising 
trait becoming fixed is much higher, since the new trait needs to spread 
to fewer organisms. By way of illustration, consider a bag containing 
fifty red and fifty blue marbles. Suppose that, by removing individual 
marbles at random, you seek to change the “population” of mixed-color 
marbles to one in which all the marbles are red. To produce a 
completely red “species,” we must generate a population in which all 
the blue marbles have been eliminated. If someone randomly picks half 
of the one hundred marbles out of the bag, it is extremely improbable 
that all of the marbles thus eliminated will be of just one color. Indeed, 
there is less than 1 chance in 10%° that all of the marbles selected for 
removal will be blue.!* Conversely, there is an extremely high 
probability that the remaining batch will still include both blue and red 
marbles. 

In a much smaller group of, say, eight marbles, evenly divided 
between four red and four blue, the probability of selecting four blue 
marbles at random and leaving only red marbles, though unlikely, is not 
prohibitively small. There’s now a much higher chance—1 in 70—that 
the remaining population of marbles will be all red.!* By starting with a 
smaller number of marbles, the probability that random selection will 
result in a population of uniform color is much higher. In a similar way, 


the probability of fixing a genetic trait in a population of organisms 
decreases exponentially with the size of the population. 

In formulating punctuated equilibrium, Gould realized that new 
species would inevitably have to arise in smaller populations, where 
random processes could have a greater chance of fixing traits. 
Prominent among those random processes is one called genetic drift. 
This occurs when genetic changes spread or disappear randomly 
through a population, without regard for their effect on survival and 
reproduction. 

In Gould and Eldredge’s view, allopatric speciation helped to explain 
how evolution could occur in larger, more discrete jumps than 
Darwinian gradualism predicts (see Fig. 7.2). As allopatric speciation 
occurs, it can generate what Gould and Eldredge conceived as sibling or 
offspring species. They thought that the processes that drive these 
speciation events occur relatively quickly in smaller populations, thus 
helping to explain the sudden jumps in the fossil record. As they put it: 
“Small numbers and rapid evolution virtually preclude the preservation 
of speciation events in the fossil record.”'* As they envisioned the 
evolutionary process working, the branches on the tree of life would 
split off so abruptly that they would appear as virtually “horizontal” 
lines, producing sudden discontinuities in the fossil record and therefore 
fewer fossilized intermediates. Eldredge and Gould explained it this 
way: “The theory of allopatric (or geographic) speciation suggests a 
different interpretation of paleontological data. If new species arise very 
rapidly in small, peripherally isolated populations, then the expectation 
of insensibly graded fossils is a chimera. A new species does not evolve 
in the area of its ancestors; it does not arise from the slow 
transformation of all its forbears.” Thus, they concluded, “Many breaks 
in the fossil record are real.”!° 
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FIGURE 7.2 

Two views of the history of life. Figure 7.2a (left): The traditional 
Darwinian picture showing slow, gradual change. Figure 7.2b (right): 
The history of life as depicted by the theory of punctuated equilibrium 
showing rapid speciation. 


Gould, Eldredge, and Stanley thought that members of these sibling 
or offspring species would, subsequent to their origin by allopatric 
speciation, compete against each other for resources and survival, just 
as, in neo-Darwinism, individual organisms or siblings may compete to 
survive and reproduce within a population. In their view, if members of 
one species succeed over another because of some selective advantages 
they possess, then that species will survive and predominate, passing on 
its traits. This process of interspecies or interpopulation competition (as 
opposed to intraspecies competition) Gould called “species selection.”!® 

As Gould himself explained: “I propose, as the central proposition of 
macroevolution, that species play the same role of fundamental 
individual that organisms assume in microevolution. Species represent 
the basic units in theories and mechanisms of macroevolutionary 
change.”!” Since natural selection then would act upon large differences 
in overall biological form—differences between whole species as 


opposed to individuals within species—evolutionary change would take 
place in bigger, more discrete jumps.!® 

Gould and Eldredge thus did not expect the fossil record to document 
many intermediates. Instead, they thought the “gaps” in the fossil record 
were “the logical and expected result of the allopatric model of 
speciation” as well as the closely related mechanism of species 
selection.!? Species selection made “the species” the unit of selection; 
the allopatric model of speciation asserted that new species quickly arise 
from smaller populations of organisms. Both mechanisms implied that 
fewer fossil intermediates would be preserved. According to punctuated 
equilibrium, the long “missing” transitional intermediates are not 
missing after all. In the process of species selection, the species, rather 
than the individual organism, competes for survival and thus becomes— 
in the jargon of evolutionary biology—the main “unit of selection” in 
macroevolution. 


“PUNK EEK” AND THE FOSSIL RECORD 


Eldredge and Gould devised the theory of punctuated equilibrium to 
eliminate the conflict between the fossil record and evolutionary theory. 
Nevertheless, punctuated equilibrium has its own problems accounting 
for the fossil record. In particular, the pattern of fossil appearance in the 
Cambrian period is inconsistent with both the way in which punctuated 
equilibrium depicts the history of life and with the idea that allopatric 
speciation and species selection are responsible for that pattern. There 
are several reasons for this. 

First, the top-down pattern of appearance of Cambrian animal forms 
that we saw in Chapter 2 contradicts punctuated equilibrium’s depiction 
of the history of life almost as much as it does the Darwinian picture 
(see Fig. 2.11). Recall that Darwin thought that the first representatives 
of the higher taxonomic categories emerged after the first appearance of 
representatives of each of the lower level taxa—that small differences 
distinguishing, for example, one species from another should gradually 
accumulate until they produced organisms different enough to be 


classified, first, as different genera, then, as different families, and 
eventually as different orders, classes, and so on. Instead, the first 
Cambrian animal forms are different enough from each other to justify 
classifying them as separate classes, subphyla, and phyla from their first 
appearance in the fossil record (see Fig. 7.3). 

This pattern creates an acute difficulty for the theory of punctuated 
equilibrium., First, due to the action of allopatric speciation and species 
selection, advocates of punctuated equilibrium envision morphological 
change (represented as horizontal distance in Figure 7.2) arising in 
larger, more discontinuous increments of change. Nevertheless, like 
neo-Darwinists, they too see phyla-level differences arising from the 
“bottom up,” starting with lower level taxonomic differences—albeit 
occurring in increments involving whole new species rather than 
individuals or varieties within species. Indeed, according to the theory 
of punctuated equilibrium, allopatric speciation first produces new 
species in smaller geographically isolated populations. For 
representatives of higher taxonomic categories to arise, these new 
species must accumulate new traits and evolve further. For this reason, 
punctuated equilibrium also expects small-scale diversity and 
differentiation of new species to precede the emergence of larger-scale 
morphological disparity and taxonomic differences. It also expects a 
“bottom-up” rather than a “top-down” pattern of appearance (see Fig. 
7.3). 

Second, for species selection to produce many new species, such as 
those that arise in the Cambrian explosion, a large pool of different 
species must first exist. The Precambrian fossil record does not 
document, however, the existence of such a large and diverse pool of 
competing Precambrian species upon which species selection (via 
allopatric speciation) might operate. 

Paleontologists Douglas Erwin and James Valentine exposed this 
problem in 1987 in a seminal paper titled “Interpreting Great 
Developmental Experiments: The Fossil Record.”?? They questioned 
the ability of both of the main evolutionary theories of the time— 
punctuated equilibrium and neo-Darwinism—to explain the pattern of 


fossil appearance in the Precambrian—Cambrian fossil record.*! Clearly, 
neo-Darwinism does not explain this pattern. But, as Valentine and 
Erwin argue, neither does punctuated equilibrium. As they conclude, the 
mechanism of species selection requires a large pool of species upon 
which to act. Thus, Valentine and Erwin conclude: “The probability that 
species selection is a general solution to the origin of higher taxa is not 
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This figure shows that the theory of punctuated equilibrium (on /eft), 
like neo-Darwinism, anticipates small-scale diversity preceding large- 
scale disparity in form in contrast to the pattern in the fossil record 
(shown on right). Punctuated equilibrium also anticipates a “bottom- 
up,” rather than a “top-down,” pattern. 


The late-Precambrian and Cambrian fossil records present another 
difficulty for punctuated equilibrium. Though Gould and Eldredge 
envisioned new traits becoming fixed in small isolated populations 
where speciation eventually occurs, they envisioned these traits first 
arising during periods of stasis in the large populations from which the 
smaller populations later separated. Gould realized that only stable large 


populations would afford enough opportunities for mutations to 
generate the new traits that macroevolution requires.” At the same 
time, he recognized that these new traits would have a far greater 
chance of being fixed into small, isolated populations where the random 
loss of some traits makes the fixation of others more likely (recall the 
marble example). By relying on large populations to generate new traits 
and small populations to fix them throughout a population, Gould 
wanted to provide both a plausible (if finely tuned) mechanism to 
explain both macroevolutionary change and the absence of fossil 
intermediates.** The late University of Chicago paleontologist Thomas 
J. M. Schopf described the balance this way, under punctuated 
equilibrium, evolution proceeds “in populations large enough to be 
reasonably variable, but small enough to permit large changes in gene 
frequencies due to random drift.””° 

But by relying on the accumulation of new traits within large parent 
populations, Gould undercut his own rationale for concluding that the 
fossil record should not preserve many intermediate forms. The reason 
for this is obvious: if novel genetic traits arise and spread within a large 
population of organisms, they are more likely to leave behind fossil 
evidence of their existence. Organisms with new and unique 
combinations or mosaics of traits represent nothing less than new forms 
of life. Thus, the process by which Gould envisions new genetic traits 
arising in large populations implies that new forms of life—some 
presumably transitional to other forms—should be preserved in the 
fossil record. Yet the Precambrian fossil record fails to preserve such a 
wealth of biological experiments during the long periods of relative 
stability in large populations that Gould’s theory envisions. 


THE TESTIMONY OF STATISTICAL PALEONTOLOGY 


Studies in statistical paleontology have raised additional questions about 
whether the fossil record documents enough transitional intermediates 
to render punctuated equilibrium credible. In Chapter 3, I discussed the 
work of the statistical paleontologist Michael Foote. Foote used 


sampling theory to argue that the fossil record provides a reasonably 
complete picture of the forms of life that have existed on earth and to 
suggest that paleontologists are unlikely to find the many intermediate 
forms that neo-Darwinian theory requires. 

Foote has also analyzed the question of whether the fossil record 
documents the number of intermediate forms that punctuated 
equilibrium requires. His answer: it depends. 

Foote notes that whether a particular version of evolutionary theory 
can account for patterns in the fossil record depends upon the kind of 
mechanism of change it invokes. Neo-Darwinism relies upon a slow, 
gradually acting mechanism of change and, thus, has difficulty 
accounting for evidence of sudden appearance. Foote analyzes whether 
the number of proposed transitional species in the fossil record is 
consistent with punctuated equilibrium and concludes that it depends 
upon how fast the mechanisms upon which it relies can generate new 
forms of life. Though its proponents envision the evolution of new 
forms of life arising more abruptly than do the neo-Darwinists, they 
would still expect that the fossil record should have preserved some 
transitional fossils. Does the fossil record preserve even the relatively 
few intermediate forms that the theory of punctuated equilibrium 
implies that it should? 

To answer that question, Foote developed a statistical method of 
testing the adequacy of different evolutionary models against several 
variables.*° He observed that for punctuated equilibrium to succeed as 
an explanation for the data of the fossil record, it needs a mechanism 
capable of producing major evolutionary change quickly, because only 
such fast-acting change could account for the relative paucity of 
transitional forms in the fossil record. As Foote explained (writing with 
Gould in fact), the adequacy of punctuated equilibrium as an account of 
the fossil record depends upon the existence of a mechanism “of 
unusual speed and flexibility.”/ 

But does the theory identify such a mechanism? 


FROM WHENCE NEW TRAITS AND FORM? 


Neither allopatric speciation nor species selection can generate the new 
genetic and anatomical traits necessary to produce animal forms, let 
alone in the relatively brief time of the Cambrian explosion. As 
conceived by Gould and the other advocates of punctuated equilibrium, 
allopatric speciation just allows for the possibility of the rapid fixation 
of preexisting traits, not the generation of new traits. When a parent 
population splits into two or more daughter populations, each of the 
daughter populations retains a part, but usually not the whole, of the 
gene pool of the original population. No new genetic traits are generated 
by the geographical isolation of one part of a population from another. 

It could be argued, of course, that mutations might occur during the 
process of speciation, thus generating new genetic traits. But as Gould 
and Eldredge conceived of it, allopatric speciation occurs much too 
rapidly to have a reasonable chance of mutations generating anything 
fundamentally new. Darwin recognized in On the Origin of Species that 
evolution is a numbers game: larger population sizes and more 
generations offer more opportunities for favorable new variations to 
arise. As he explained: “Forms existing in larger numbers will always 
have a better chance . . . of presenting further favourable variations for 
natural selection to seize on, than will the rarer forms which exist in 
lesser numbers.”*® Yet for the mechanism of allopatric speciation to 
generate new traits, it would need to generate significant changes in 
form in small “peripherally isolated” populations over relatively few 
generations.?? Because of these constraints, many biologists have 
concluded that allopatric speciation requires too much change too 
quickly to provide the theory of punctuated equilibrium with a 
biologically plausible mechanism for producing new traits or forms of 
animal life. 

And that is why Gould and Eldredge, especially in their later 
formulations of the theory, envisioned new traits arising during long 
periods of stasis in larger populations rather than during short bursts of 
speciation. But a process in which traits arise “during long periods of 
stasis” does not constitute a “mechanism of unusual speed and 
flexibility,” though that is precisely what, according to Gould and Foote, 


punctuated equilibrium requires in order to explain the abrupt 
appearance of new animal forms. 

If allopatric speciation does not produce a fast-acting trait-generating 
mechanism, does species selection? Again, the answer is no. Species 
selection does not account for the origin of the different anatomical 
traits that distinguish one species from another. Species selection, as 
conceived by the proponents of punctuated equilibrium, acts on species 
and traits that already exist. Indeed, when Stanley, Gould, and Eldredge 
envisioned natural selection acting to favor the most fit species over 
another in a competition for survival, they presupposed the existence of 
a pool of different species and, therefore, also the existence of some 
mechanism for producing the traits that characterize those different 
species. That mechanism, however, would necessarily need to generate 
those differentiating traits before species could enter into competition 
with each other. Species selection eliminates less fit species in a 
competition for survival; it does not generate the traits that distinguish 
species and establish the basis for interspecies competition. 

So where do these traits come from? When pressed, Gould 
eventually acknowledged that the origin of anatomical traits themselves 
result from good, old-fashioned natural selection acting on random 
mutations and variations—that is, from the neo-Darwinian mechanism 
acting over long periods of time on large relatively stable populations. 
But that meant that punctuated equilibrium, to the extent it relies on 
mutation and natural selection, is subject to the same evidential and 
theoretical problems as neo-Darwinism. And one of those problems is 
that the neo-Darwinian mechanism does not act quickly enough to 
account for the explosive appearance of new fossil forms in the 
Cambrian period. Like allopatric speciation, species selection does not 
qualify as the kind of rapid and flexible mechanism that Gould 
elsewhere insisted his theory must have in order to explain the abrupt 
appearance of animal forms in the fossil record. 


NOVEL FORM AND MECHANISM 


An even more profound difficulty with punctuated equilibrium as an 
explanation for the Cambrian explosion remains. Neither species 
selection nor allopatric speciation explains the origin of the 
representatives of the higher taxonomic categories—that is, the new 
animals representing new phyla and classes. Nor does it explain the 
structural and morphological features that distinguish animals from one 
another and earlier forms of life. Allopatric speciation explains how 
populations get separated from each other to form different species. 
Species selection describes how more fit species predominate over other 
species in a competition for survival. Neither mechanism gives any 
account of how the animals representing the specifically higher taxa or 
their distinctive anatomical novelties arose. Neither mechanism 
accounts, for example, for the origin of the compound eye of a trilobite, 
nor the gills of a Cambrian fish,*? nor the echinoderm body plan. 

Many critics of punctuated equilibrium have noted this problem. As 
Richard Dawkins wrote in 1986: “What I mainly want a theory of 
evolution to do is explain complex, well-designed mechanisms like 
hearts, hands, eyes and echolocation. Nobody, not even the most ardent 
species selectionist, thinks that species selection can do this.”?! Or as 
paleontologist Jeffrey Levinton argued in 1988, “It is inconceivable how 
selection among species can produce the evolution of detailed 
morphological structures. . . . Species selection did not form an eye.”?? 

So where do these intricate structures come from? Again, when 
pressed, Gould resorted to the alleged power of the neo-Darwinian 
mechanism. As he wrote in his magisterial tome The Structure of 
Evolutionary Theory, published in 2002, the year of his death: “I do not 
deny either the wonder, or the powerful importance, of organized 
adaptive complexity.” He went on to concede, “I recognize that we 
know no mechanism for the origin of such organismal features other 
than conventional natural selection at the organismic level.”*° 

For this reason, few if any evolutionary biologists now regard 
punctuated equilibrium as a solution to the problem of the origin of 
biological form and novelty. As the evolutionary biologists Brian 
Charlesworth, Russell Lande, and Montgomery Slatkin have concluded, 


“genetic mechanisms that have been proposed [by proponents of 
punctuated equilibrium] to explain the abrupt appearance and prolonged 
stasis of many species are conspicuously lacking in empirical 
support.”>4 


BURST OF INTEREST AND GRADUAL DECLINE 


Still, it may not be entirely fair to criticize punctuated equilibrium for 
failing to account for the Cambrian explosion. Gould, in particular, 
equivocated about whether he meant punctuated equilibrium to serve as 
a comprehensive theory of macroevolutionary change or just an account 
of how new species emerged from a pool of preexisting species. Strictly 
speaking, the mechanisms of allopatric speciation and species selection 
sought to explain the pattern of stasis and discontinuity among different 
species and not among the higher taxa. Thus, near the end of his career, 
Gould complained about his critics “misunderstanding” his theory by 
asserting that he “proclaimed the total overthrow of Darwinism” and 
“intended punctuated equilibrium as both the agent of destruction and 
the replacement.» 

Yet Gould and Eldredge, at least initially, advanced punctuated 
equilibrium as a bold new theory of evolutionary biology, giving the 
impression that it provided an ambitious solution to the problem of 
macroevolution and, by implication, events such as the Cambrian 
explosion. From 1972 to 1980, Eldredge and Gould presented a series of 
provocative scientific papers that portrayed punctuated equilibrium as a 
bold, and even revolutionary, alternative theory of macroevolution. 
Indeed, Gould himself referred to it explicitly as “a speciational theory 
of macroevolution.”>*© 

In their second main paper, published in 1977, Gould and Eldredge 
made explicit their intention to position their theory as a “radical”>’ 
challenge to neo-Darwinian gradualism and to replace it with a 
completely different understanding of the mode and mechanism of 
evolutionary change. Sepkoski notes that in this 1977 article “the 
authors were more explicit about the exact nature of the conceptual 


reconfiguration their theory brought to macroevolution.””® In particular, 
he argues that Gould and Eldredge “extended their model to propose a 
new and ‘general philosophy of change’ in the natural world.”*? Gould 
was no less radical in a widely cited 1980 paper in the journal 
Paleobiology in which he offered punctuated equilibrium as “a new and 
general theory” of evolution. There he also famously declared the 
synthetic theory of neo-Darwinism “effectively dead, despite its 
persistence as textbook orthodoxy.’”? 

Only after critics exposed punctuated equilibrium for lacking an 
adequate mechanism did Gould retreat to a more conservative 
formulation of the theory, making its reliance upon the neo-Darwinian 
mechanism explicit. From the early 1980s until his death in 2002, Gould 
made a series of concessions in particular about the inadequacy of 
speciation and species selection as mechanisms for generating complex 
adaptations. Thus, as Sepkoski notes, “Despite the brashness of many of 
his claims on behalf of punctuated equilibrium over the years, one is 
brought time and again back to the reconciliatory, even conservative 
justifications Gould made for his theory,” particularly, he notes, in The 
Structure of Evolutionary Theory, written in the years just before 
Gould’s death.*! 

In the end, Gould’s concessions to neo-Darwinism brought his 
thinking back into conflict with the pattern of sudden appearance in the 
fossil record that the theory of punctuated equilibrium was designed to 
explain. If Gould and Eldredge were right about the abrupt appearance 
of new forms of life in the fossil record, and if the neo-Darwinian 
mechanism needs as much time as evolutionary biologists and 
population geneticists (see Chapters 8 and 12) calculate, then the 
mutation and selection mechanism does not have enough time to 
produce the new traits needed to build the forms of life that first appear 
in the Cambrian period. But punctuated equilibrium, as initially 
formulated to rely mainly on allopatric speciation and species selection, 
fared no better, since neither mechanism gives any explanation for the 
origin of new traits. And so, in the end, punctuated equilibrium 
highlighted rather than resolved a profound dilemma for evolutionary 


theory: neo-Darwinism allegedly has a mechanism capable of producing 
new genetic traits, but it appears to produce them too slowly to account 
for the abrupt appearance of new form in the fossil record; punctuated 
equilibrium attempts to address the pattern in the fossil record, but fails 
to provide a mechanism that can produce new traits whether abruptly or 
otherwise. No wonder, then, that leading Cambrian paleontologists such 
as James Valentine and Douglas Erwin concluded in 1987, that “neither 
of the contending theories of evolutionary change at the species level, 
phyletic gradualism or punctuated equilibrium, seem applicable to 
[explaining] the origin of new body plans.”4? 


SPINNING IN CIRCLES 


In a sudden flash of insight in the humdrum of a Laundromat, Niles 
Eldredge realized that stasis in the fossil record represented evidence 
rather than mere investigative failure. But like laundry spinning around 
in a washing machine, the theory of punctuated equilibrium itself 
became caught in a dreary cycle of contradiction. On the one hand, 
“punk eek” made a bold attempt to describe more accurately, and even 
explain, the decidedly discontinuous pattern of the fossil record. On the 
other, its advocates were forced to concede both the inadequacy of their 
proposed mechanisms and their need to rely upon the neo-Darwinian 
process of mutation and selection in order to account for the origin of 
new genetic traits and anatomical innovations. After Gould appeared to 
jettison both gradualism and a reliance on the neo-Darwinian 
mechanism in order to bring evolutionary theory into conformity with 
the fossil record, he eventually acknowledged that he could not explain 
the origin of the forms of life documented in the fossil record apart from 
that same slow and gradually acting mechanism. Thus, though the 
theory of punctuated equilibrium was initially presented as a solution to 
the mysterious and sudden origin of animal forms, upon closer 
inspection, it failed to offer such a solution. 

Nevertheless, the failure of punctuated equilibrium to provide a 
sufficient mechanism has raised questions about the adequacy of the 


mechanism that Gould ultimately did reaffirm as the explanation for the 
origin of novel biological form. Can the neo-Darwinian mechanism of 
natural selection acting on random mutations build new forms of animal 
life with all their complex adaptations? If so, is it possible that it could 
do so in the brief time allowed by the fossil record? If not, is it 
reasonable to think that it could build new forms of animal life if only 
more time were available? If so, how much time would the Darwinian 
mechanism need to build complex adaptations and new forms of animal 
life? In the next several chapters, I will address these fundamental 
questions at the heart of the Cambrian mystery—questions, in brief, 
about how to build an animal. 


Part Two 


How to Build an Animal 
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The Cambrian Information Explosion 


When I was a college professor, I used to ask my students a question: “If 
you want your computer to acquire a new function or capability, what do 
you have to give it?” Typically, I would hear a smattering of similar 
answers from the class: “code,” “instructions,” “software,” 
“information.” Of course, all these are correct. And thanks to discoveries 
in modern biology, we now know that something similar is true of life: 
to build a new form of life from a simpler preexisting form requires new 
information. 

To this point I’ve examined one main aspect of the mystery 
surrounding the Cambrian explosion: the mystery of the missing 
Precambrian ancestral forms expected on the basis of Darwin’s theory. 
The next group of chapters will examine a second, and perhaps more 
profound, aspect of the Cambrian mystery: that of the cause of the 
Cambrian explosion. By what means or process or mechanism could 
something as complex as a trilobite have arisen? Could natural selection 
have accomplished such a feat? To answer this question we will have to 
look more closely at what it takes to build a new form of animal life. 
And we’ll see that an important part of the answer to that question will 
have something to do with the concept of information. 


THE DARWINIAN ACCOUNT OF THE ORIGIN OF ANIMAL 
FORM 


As Darwin envisioned the process, natural selection can accomplish 
nothing without a steady supply of variation as a source of new 


biological traits, forms, and structures. Only after useful new variations 
arise can natural selection sift them from the chaff of unhelpful 
variations. If, however, the amount of variation available to natural 
selection is limited, then natural selection will encounter limits on how 
much new biological form and structure it can build. 

Even in the late nineteenth century, many leading scientists 
recognized this. For this reason, there has been a long history of 
scientific controversy about just how much novelty natural selection can 
produce and about whether natural selection is a truly creative process. 
In fact, between 1870 and 1920 classical Darwinism entered a period of 
eclipse, because many scientists thought that it could not explain the 
origin and transmission of new heritable variation. ! 

Darwin favored a theory of blended inheritance that seemed to imply 
limitations on the amount of genetic variability.” He thought that when 
parents with different traits combined germ cells during sexual 
reproduction, the resulting offspring would receive not one or the other 
set of their differing traits, but instead a compromise version. For 
example, if a male bird with red feathers on its wings mated with a 
female bird of the same species with white wing feathers, the theory 
implied that the two would likely produce offspring with pink wing 
feathers. As many of Darwin’s contemporaries pointed out, such 
instances of blending inheritance involved strict limitations on the range 
of traits that could possibly arise, depriving natural selection of the 
wide-ranging supply of variation it would need to produce truly 
fundamental changes in the form of animals. The pink-feathered 
offspring might later reproduce with a white- or red-feathered bird of the 
same species, producing a slightly lighter or darker shade of pink 
feathers. Nevertheless, descendants of the original white- and red- 
feathered pair would never produce green, blue, or yellow feathers in 
subsequent generations. If correct, blending inheritance would 
eventually lead to a bland, homogenous, variationless state in a 
population. 

In the 1860s, the Austrian monk Gregor Mendel, widely regarded as 
the founder of modern genetics, showed in his work on garden peas that 


Darwin’s assumptions about blending inheritance were incorrect. The 
results of his studies created, at least initially, more problems for 
Darwinism. Mendel showed that the genetic traits of organisms typically 
have an integrity that resists blending. He showed this by cross- 
pollinating plants with yellow peas and plants with green peas. The 
plants in the subsequent generations produced either yellow or green 
peas, but nothing in between and nothing with an altogether different 
color.° 

He also showed that the plants carried some kind of signal or 
instructions for building different traits even when the trait was not on 
display in a particular plant. He noticed, for example, that when he 
crossed pea plants with green and yellow seeds, the next generation had 
only yellow seeds, almost as if the capacity for generating green peas 
had been lost. But when he cross-pollinated the second-generation 
plants, the ones with only yellow peas, he found that both yellow and 
green peas emerged in the third generation, in a ratio of 3 to 1. From this 
Mendel hypothesized that the second-generation plants continued to 
carry signals, which he called “factors,” and later scientists called genes, 
for generating green peas even when those plants themselves did not 
display that trait. 

The classical Mendelian genetics that replaced Darwin’s blending 
theory of inheritance also suggested limitations on the amount of genetic 
variability available to natural selection. If plant reproduction produced 
either green or yellow peas but never some intermediate form, and if the 
signals for producing the green traits and yellow traits persisted 
unchanged from generation to generation, it was difficult to see how 
sexual reproduction and genetic recombination could produce anything 
more than unique combinations of already existing traits. 

In the decades immediately after Mendel’s work, geneticists came to 
understand genes as discrete units or packets of hereditary information 
that could be independently sorted and shuffled within the chromosome. 
This too suggested that a significant, but still strictly limited, amount of 
genetic variation could arise by genetic recombination during sexual 
reproduction. Thus, Mendelian genetics raised significant questions 


about whether the process of natural selection has access to enough 
variation (which, after Mendel, was conceived as genetic variation) to 
allow it to produce any significant morphological novelty. For a time, 
Darwin’s theory was in retreat. 


DARWINISM MUTATES 


During the 1920s, 1930s, and 1940s, however, developments in genetics 
revived natural selection as the main engine of evolutionary change. 
Experiments performed by Hermann Muller in 1927 showed that X-rays 
could alter the genetic composition of fruit flies, resulting in unusual 
variations. Muller called these X-ray-induced changes “mutations.” 
Other scientists soon reported that they had produced mutations in the 
genes of other organisms, including humans. Whatever genes were made 
of—and biologists then still did not know—these developments 
suggested they could vary more than either Darwin or classical 
Mendelian genetics had assumed. Geneticists at the time also discovered 
that these small-scale changes in genes were potentially heritable.° But if 
variant versions of genes were heritable, then presumably natural 
selection could favor advantageous gene variants and eliminate the 
others. These mutations could then influence the future direction of 
evolution and, in theory at least, provide an unlimited supply of variation 
for natural selection’s workshop. 

The discovery of genetic mutations also suggested a way to reconcile 
Darwinian theory with insights from Mendelian genetics. During the 
1930s and 1940s, a group of evolutionary biologists, including Sewall 
Wright, Ernst Mayr, Theodosius Dobzhansky, J. B. S. Haldane, and 
George Gaylord Simpson, attempted to demonstrate this possibility 
using mathematical models to show that small-scale variations and 
mutations could accumulate over time in whole populations, eventually 
producing large-scale morphological change. These mathematical 
models formed the basis of a subdiscipline of genetics known as 
population genetics. The overall synthesis of Mendelian genetics with 


Darwinian theory came to be called “neo-Darwinism” or simply the 
“New Synthesis.” 

According to this new synthetic theory, the mechanism of natural 
selection acting upon genetic mutations suffices to account for the origin 
of novel biological forms. Small-scale “microevolutionary” changes can 
accumulate to produce large-scale “macroevolutionary” innovations. 
The neo-Darwinists argued that they had revived natural selection by 
discovering a specific mechanism of variation that could generate new 
forms of life from simpler preexisting ones. By the centennial 
celebration of Darwin’s Origin of Species in 1959, it was widely 
assumed that natural selection and random mutations could indeed build 
new forms of life over the course of time with their distinctive body 
plans and novel anatomical structures. At the celebration, Julian Huxley, 
the grandson of T. H. Huxley, summarized this optimism in a grand 
proclamation: 


Future historians will perhaps take this Centennial Week as 
epitomizing an important critical period in the history of this earth 
of ours—the period when the process of evolution, in the person of 
inquiring man, began to truly be conscious of itself. . . . This is one 
of the first public occasions on which it has been frankly faced 
that all aspects of reality are subject to evolution, from atoms and 
stars to fish and flowers, from fish and flowers to human societies 


and values—indeed, that all reality is a single process of evolution 
7 


In a television broadcast leading up to the Centennial celebration, 
Huxley captured the optimistic mood more succinctly: “Darwinism has 
come of age, so to speak. We are no longer having to bother about 
establishing the fact of evolution.”® 


VARIATION AS INFORMATION 


Initially, the elucidation of the structure of DNA by James Watson and 
Francis Crick in 1953 contributed to this euphoria.’ Indeed, it seemed to 
lift the mechanism of genetic variation and mutation out of the mist and 
into the clear light of the emerging science of molecular biology. Watson 
and Crick’s elucidation of the double helix structure of DNA suggested 
that DNA stored genetic information in the form of a four-character 
digital and chemical code (see Fig. 8.1). Later, following the formulation 
of Francis Crick’s famed “sequence hypothesis,” molecular biologists 
confirmed that the chemical subunits along the spine of the DNA 
molecule called nucleotide bases function just like alphabetic characters 
in a written language or digital characters in a machine code. Biologists 
established that the precise arrangement of these nucleotide bases 
conveyed instructions for building proteins.!° (See Fig. 8.2.) Molecular 
biologists also determined that this store of genetic information in DNA 
is transmitted from one generation of cells and organisms to another. In 
short, it was established that DNA stores hereditary information for 
building proteins and thus, presumably, for building higher-order 
anatomical traits and structures as well. 

The elucidation of the double helix seemed to resolve some long- 
standing issues in evolutionary biology. Darwinists had long maintained 
that natural selection produced new forms by separating the proverbial 
wheat from the chaff of genetic variation, but they didn’t know where 
the raw material for all of the competing variations resided. Neither did 
they know how genes stored information for producing the traits 
associated with them. Further, even after geneticists discovered that 
stable genetic traits could be altered by mutations, they remained 
uncertain about what exactly it was that was being “mutated.” 
Consequently, biologists were uncertain about exactly where variations 
and mutations occurred. 





FIGURE. 8.1 

James Watson (/eft) and Francis Crick (right) presenting their model of 
the structure of the DNA molecule in 1953. Courtesy A. Barrington 
Brown/Science Source. 


Watson and Crick’s model suggested an answer to that question: 
genes correspond to long sequences of bases on a strand of DNA. 
Building on that insight, evolutionary biologists proposed that new 
variations arose, first, from the genetic recombination of different 
sections of DNA (different genes) during sexual reproduction and, 
second, from a special kind of variation called mutations that occur from 
random changes in the arrangement of nucleotide bases in DNA. Just as 
a few typographical errors in an English sentence might alter the 
meaning of a few words or even the whole sentence, so too might a 
change in the sequential arrangement of the bases in the genetic “text” in 
DNA produce new proteins or morphological traits. 
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FIGURE 8.2 

The model (or structural formula) of the DNA molecule showing the 
digital or alphabetic character of the nucleotide bases stored along the 
sugar-phosphate backbone of the molecule. 


Watson and Crick’s discovery also raised new questions—in 
particular, questions about the information necessary to build completely 
new forms of life during the course of biological evolution. True, 
mutations play a role in this process, but could they generate enough 
information to produce novel forms of animal life such as those that 
arose in the Cambrian period, an explosion—a vast proliferation—of 
new biological information? 


THE CAMBRIAN INFORMATION EXPLOSION 


Consider choanoflagellates, a group of single-celled eukaryotic 
organisms with a flagellum. What separates such organisms from a 
trilobite or a mollusk or even a lowly sponge? Clearly, all three of these 
higher forms of life are more complex than any one-celled organism. But 
exactly how much more complex? 

James Valentine has noted that one useful way of comparing degrees 
of complexity is to assess the number of cell types in different organisms 
(see Fig. 8.3).'! Though a single-celled eukaryote has many specialized 
internal structures such as a nucleus and various organelles, it still, 
obviously, represents just a single type of cell. Functionally more 
complex animals require more cell types to perform their more diverse 
functions. Arthropods and mollusks, for example, have dozens of 
specific tissues and organs, each of which requires “functionally 
dedicated,” or specialized, cell types. 

These new cell types, in turn, require many new and specialized 
proteins. An epithelial cell lining a gut or intestine, for example, secretes 
a specific digestive enzyme. This enzyme requires structural proteins to 
modify its shape and regulatory enzymes to control the secretion of the 
digestive enzyme itself. Thus, building novel cell types typically 


requires building novel proteins, which requires assembly instructions 
for building protetns—that is, genetic information. Thus, an increase in 
the number of cell types implies an increase in the amount of genetic 
information. 
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FIGURE 8.3 
Biological complexity scale as measured in number of cell types of 
different organisms. 


Applying this insight to ancient life-forms underscores just how 
dramatic the Cambrian explosion was. For over 3 billion years, the 
living world included little more than one-celled organisms such as 
bacteria and algae.!? Then, beginning in the late Ediacaran period (about 
555-570 million years ago), the first complex multicellular organisms 
appeared in the rock strata, including sponges and the peculiar Ediacaran 
biota discussed in Chapter 4.!? This represented a large increase in 
complexity. Studies of modern animals suggest that the sponges that 
appeared in the late Precambrian, for example, probably required about 
ten cell types. !4 

Then 40 million years later, the Cambrian explosion occurred.!> 
Suddenly the oceans swarmed with animals such as trilobites and 
anomalocaridids that probably required fifty or more cell types—an even 
greater jump in complexity. Moreover, as Valentine notes, measuring 


complexity differences by measuring differences in the number of cell 
types probably “greatly underestimate[s] the complexity differentials 
between bodyplans.” 16 

One way to estimate the amount of new genetic information that 
appeared with the Cambrian animals is to measure the size of the 
genomes of modern representatives of the Cambrian groups and 
compare that to the amount of information in simpler forms of life. 
Molecular biologists have estimated that a minimally complex single- 
celled organism would require between 318,000 and 562,000 base pairs 
of DNA to produce the proteins necessary to maintain life.!” More 
complex single cells might require upwards of a million base pairs of 
DNA. Yet to assemble the proteins necessary to sustain a complex 
arthropod such as a trilobite would need orders of magnitude more 
protein-coding instructions. By way of comparison, the genome size of a 
modern arthropod, the fruit fly Drosophila melanogaster, is 
approximately 140 million base pairs.'* Thus, transitions from a single 
cell to colonies of cells to complex animals represent significant—and in 
principle measurable—increases in genetic information. 

During the Cambrian period a veritable carnival of novel biological 
forms arose. But because new biological form requires new cell types, 
proteins, and genetic information, the Cambrian explosion of animal life 
also generated an explosion of genetic information unparalleled in the 
previous history of life.!? (In Chapter 14, we’ll see that building a new 
animal body plan also requires another type of information, not stored in 
genes, called epigenetic information.) 

So can the neo-Darwinian mechanism explain the dramatic increase 
in genetic information that appears in the Cambrian explosion? Before 
addressing that question, it will help to define the concept of information 
and identify the kind of information that DNA contains. 


BIOLOGICAL INFORMATION: SHANNON OR OTHERWISE? 


Scientists typically recognize at least two basic types of information, 
functional (or meaningful) information and so-called Shannon 


information, which is not necessarily meaningful or functional. The 
distinction has arisen in part because of developments in a branch of 
applied mathematics known as information theory. During the late 
1940s, mathematician Claude Shannon, working at the Bell 
Laboratories, developed a mathematical theory of information. Shannon 
equated the amount of information transmitted by a sequence of symbols 
or characters with the amount of uncertainty reduced or eliminated by 
the transmission of that sequence.”” 

Shannon thought that an event or communication that didn’t 
eliminate much uncertainty was also not very informative. Consider an 
illustration. In the 1970s, when I was a teenager, if someone made a 
completely obvious statement, we would say, “Tell me something else I 
didn’t know.” Imagine that one of my classmates on the baseball team 
has just rushed up to breathlessly “inform” me that our team’s star 
pitcher is planning to throw the ball to the catcher in the next game. 
Such a statement would earn the scornful reply, “Tell me something else 
I didn’t know.” 

The obvious statement about the pitcher’s intentions also illustrates 
why Shannon equated the elimination of uncertainty with the 
transmission of information. Since star pitchers who want to go on being 
star pitchers have no choice but to throw the ball across the plate to the 
catcher, the statement of my overwrought friend eliminated no previous 
uncertainty. It was not informative in the least. If, on the other hand, 
after days of speculation on campus about which one of our team’s four 
pitchers the baseball coach would choose to pitch in the championship 
game, my friend had rushed up to me and revealed the identity of the 
starting pitcher, that would be different. In that case, he would have 
eliminated some significant uncertainty on my part with a decidedly 
informative statement. 

Shannon’s theory quantified the intuitive connection between reduced 
uncertainty and information by asserting that the more uncertainty an 
event or communication eliminated, the more information it conveyed. 
Imagine that after revealing to me the identity of the starting pitcher 
before the championship baseball game in the spring, he then also 


revealed to me the identity of the starting quarterback before the 
upcoming football season. Imagine as well that our baseball team had 
four equally competent pitchers and the football team had just two 
equally competent quarterbacks. Given these facts, my friend’s decision 
to inform me of the identity of the starting pitcher eliminated more 
uncertainty than his decision to reveal to me the identity of the starting 
quarterback. 

To assign precise quantitative measures of information, Shannon 
further linked the reduction of uncertainty and information to 
quantitative measures of probability (or improbability). Notice that in 
my illustration, the more informative communication reduced more 
uncertainty and also described a more improbable event. The probability 
of any one of the four pitchers being selected was 1 in 4. The probability 
of one of the quarterbacks being selected, given the same assumption, 
was only 1 in 2. The more improbable event eliminated more 
possibilities and more uncertainty. Thus, it conveyed more information. 

Shannon applied these intuitions to quantify the amount of 
information present in sequences of symbols or characters stored in texts 
or codes or transmitted across communications channels. Thus, in his 
theory the presence of an English letter in a sequence of other such 
letters transmits more information than a single binary digit (a zero or 
one) in a section of computer code. Why? Again, the letter in the English 
alphabet reduces uncertainty among twenty-six possibilities, whereas a 
single binary digit reduces uncertainty among only two. The probability 
of any one character from the English alphabet occurring in a sequence 
of other such letters (disregarding the need for spaces and punctuation) 
is 1 in 26. The probability of either a zero or one arising in a sequence of 
binary characters is 1 in 2. In Shannon’s theory the presence of the more 
improbable character conveys more information. 

Yet even a binary alphabet can convey an unlimited amount of 
information because, in Shannon’s theory, additional information is 
conveyed as improbabilities multiply. Imagine a grab bag of tiles with 
either zero or one etched onto each. Imagine someone producing a series 
of zeros and ones by reaching into the bag and placing them one by one 


on a game board. The probability of choosing a zero on the first pick is 
just 1 in 2. But the probability of choosing two consecutive zeros after 
placing the first back in the grab bag (and shaking the tiles) is 1 chance 
in 2 x 2, or 1 chance in 4. This is because there are four possible 
combinations of digits that could have been chosen—00, 01, 10, or 11. 
Similarly, the probability of producing any three-letter sequence as a 
result of choosing consecutively in this manner is 1 in 2 x 2 x 2, or 1 in 
23 (or 1 in 8). The improbability of any specific sequence of characters 
increases exponentially with the number of characters in the sequence. 
Thus, longer and longer sequences can generate larger and larger 
amounts of information even using a simple binary alphabet. 

Information scientists measure such informational increases through 
a unit they call a bit. A bit represents the minimum amount of 
information that can be conveyed (or uncertainty reduced) by a single 
digit in a two-character alphabet.7! 

Biologists can readily apply Shannon’s information theory to measure 
the amount of Shannon information in a sequence of DNA bases (or the 
sequence of amino acids in a protein) by assessing the probability of the 
sequence occurring and then converting that probability to an 
information measure in bits.22 DNA conveys information, in Shannon’s 
sense, in virtue of its containing long improbable arrangements of four 
chemicals—the four bases that fascinated Watson and Crick—adenine, 
thymine, guanine, and cytosine (A, T, G, and C). As Crick realized in 
formulating his sequence hypothesis, these nucleotide bases function as 
alphabetic or digital characters in a linear array. Since each of the four 
bases has an equal 1 in 4 chance of occurring at each site along the spine 
of the DNA molecule, biologists can calculate the probability, and thus 
the Shannon information, or what is technically known as the 
“information-carrying capacity,’ of any particular sequence n bases 
long. For instance, any particular sequence three bases long has a 
probability of 1 chance in 4 x 4 4, or 1 chance in 64, of occurring— 
which corresponds to 6 bits of Shannon information. (Indeed, each base 
in a DNA sequence conveys 2 bits of information, since | in 4 is equal 
to 1 chance in 2 x 2.) 


Yet the applicability of Shannon information theory to molecular 
biology has, to some degree, obscured a key distinction concerning the 
type of information that DNA possesses. Although Shannon’s theory 
measures the amount of information in a sequence of symbols or 
characters (or chemicals functioning as such), it doesn’t distinguish a 
meaningful or functional sequence from useless gibberish. For example: 


“we hold these truths to be self-evident” 
“ntnyhiznslhtgeqkahedsjnfplkneymsed” 


These two sequences are equally long and equally improbable if we 
imagine them being drawn at random. Thus, they contain the same 
amount of Shannon information. Yet clearly there is an important 
qualitative distinction between them that the Shannon measurement does 
not capture. The first meaningful sequence performs a communication 
function, while the second does not. 

Shannon emphasized that the kind of information his theory 
described needs to be carefully distinguished from our common notions 
of information. As Warren Weaver, one of Shannon’s close 
collaborators, made clear in 1949, “The word ‘information’ in this 
theory is used in a special mathematical sense that must not be confused 
with its ordinary usage.”?> By ordinary usage, Weaver, of course, was 
referring to the idea of meaningful or functional communication. 

Webster’s dictionary defines information as “the communication or 
reception of knowledge or intelligence.” It also defines information as 
“the attribute inherent in, and communicated by, alternative sequences or 
arrangements of something that produce a specific effect.” A sequence 
of characters possessing a large amount of Shannon information may 
convey meaning (as in an English text) or perform a function that 
“produces a specific effect” (as do both English sentences and computer 
codes, for example) or it may not (as would be the case with a 
meaningless pile of letters or a screen of scrambled computer code). In 
any case, Shannon’s purely mathematical theory of information does not 
distinguish the presence of meaningful or functional sequences from 


merely improbable, though meaningless ones. It only provides a 
mathematical measure of the improbability—or information-carrying 
capacity—of a sequence of characters. In a sense, it provides a measure 
of a sequence’s capacity to carry functional or meaningful information. 
It does not, and cannot, determine whether the sequence in question does 
convey meaning or generate a functionally significant effect. 

Strands of DNA contain information-carrying capacity—something 
Shannon’s theory can measure.** But DNA, like natural languages and 
computer codes, also contains functional information.” 

In languages such as English, specifically arranged characters convey 
functional information to conscious agents. In computer or machine 
code, specifically arranged characters (zeros and ones) produce 
functionally significant outcomes within a computational environment 
without a conscious agent receiving the meaning of the code inside the 
machine. In the same way, DNA stores and conveys functional 
information for building proteins or RNA molecules, even if it is not 
received by a conscious agent. As in computer code, the precise 
arrangement of characters (or chemicals functioning as characters) 
allows the sequence to “produce a specific effect.” For this reason, I also 
like to use the term specified information as a synonym for functional 
information, because the function of a sequence of characters depends 
upon the specific arrangement of those characters. 

And DNA contains specified information, not just Shannon 
information or information-carrying capacity. As Crick himself put it in 
1958, “By information I mean the specification of the amino acid 
sequence in protein. . . . Information means here the precise 
determination of sequence, either of bases in the nucleic acid or on 
amino acid residues of the protein.”*° 


THE MESSAGE AS THE MYSTERY 


So if the origin of the Cambrian animals required vast amounts of new 
functional or specified information, what produced this information 
explosion? Since the molecular biological revolution first highlighted 


the primacy of information to the maintenance and function of living 
systems, questions about the origin of information have moved 
decidedly to the forefront of discussions about evolutionary theory. 
What’s more, the realization that specificity of arrangement, rather than 
mere improbability, characterizes the genetic text has raised some 
challenging questions about the adequacy of the neo-Darwinian 
mechanism. Is it plausible to think that natural selection working on 
random mutations in DNA could produce the highly specific 
arrangements of bases necessary to generate the protein building blocks 
of new cell types and novel forms of life? Perhaps nowhere do such 
questions pose more of a challenge to neo-Darwinian theory than in 
discussions of the Cambrian explosion. 
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Combinatorial Inflation 


Murray Eden (see Fig. 9.1), a professor of engineering and computer 
science at MIT, was accustomed to thinking about how to build things. 
But when he began to consider the importance of information to building 
living organisms, he realized something didn’t add up. His critics said 
that he knew just enough biology to be dangerous. In retrospect, they 
were probably right. 

In the early 1960s, just as molecular biologists had confirmed Francis 
Crick’s famed sequence hypothesis, Eden began to think about the 
challenge of building a living organism. Of course, Eden wasn’t 
contemplating building a living organism himself. Rather, he was 
thinking about what it would take for the neo-Darwinian mechanism of 
natural selection acting on random mutations to do the job. He wondered 
whether mutation and selection could generate the needed functional 
information. 

To his way of thinking, specificity was a big part of the problem. 
Obviously, if DNA contained an improbable sequence of nucleotide 
bases in which the arrangement of bases does not matter to the function 
of the molecule, then random mutational changes in the sequence of 
bases would not have a detrimental effect on the function of the 
molecule. But, of course, sequence does affect function. Eden knew that 
in all computer codes or written text in which the specificity of sequence 
determines function, random changes in sequence consistently degrade 
function or meaning. As he explained, “No currently existing formal 
language can tolerate random changes in the symbol sequences which 
express its sentences. Meaning is almost invariably destroyed.”! Thus, 


he suspected that the need for specificity in the arrangement of DNA 
bases made it extremely improbable that random mutations would 
generate new functional genes or proteins as opposed to degrading 
existing ones. 

But how improbable? How difficult would it be for random mutations 
to generate, or stumble upon, the genetically meaningful or functional 
sequences needed to supply natural selection with the raw material—the 
genetic information and variation—it needed to produce new proteins, 
organs, and forms of life? Eden wasn’t the only mathematician or 
scientist asking these questions. But the mathematically based challenge 
to evolutionary theory that he helped to initiate would indeed prove 
dangerous to neo-Darwinian orthodoxy. 


THE WISTAR INSTITUTE CONFERENCE 


During the early 1960s, Eden began discussing the plausibility of the 
neo-Darwinian theory of evolution with several MIT colleagues in math, 
physics, and computer science. As the discussion grew to include 
mathematicians and scientists from other institutions, the idea of a 
conference was born. In 1966, a distinguished group of mathematicians, 
engineers, and scientists convened a conference at the Wistar Institute in 
Philadelphia called “Mathematical Challenges to the Neo-Darwinian 
Interpretation of Evolution.” Prominent among the attendees were 
Marcel-Paul Schitzenberger, a mathematician and physician at the 
University of Paris; Stanislaw Ulam, the codesigner of the hydrogen 
bomb; and Eden himself. The conference also included a number of 
prominent biologists, including Ernst Mayr, an architect of modern neo- 
Darwinism, and Richard Lewontin, at the time a professor of genetics 
and evolutionary biology at the University of Chicago. 





FIGURE 9.1 
Murray Eden. Courtesy MIT Museum. 


Sir Peter Medawar, a Nobel laureate and the director of the North 
London Medical Research Council’s laboratories, chaired the meeting. 
In his opening remarks, he said, “The immediate cause of this 
conference is a pretty widespread sense of dissatisfaction about what has 
come to be thought of as the accepted evolutionary theory in the 
English-speaking world, the so-called neo-Darwinian theory.”” 

For many, doubts about the creative power of the mutation and 
selection mechanism stemmed from the elucidation of the nature of 
genetic information by molecular biologists in the late 1950s and early 
1960s. 

The discovery that the genetic information in DNA is stored as a 
linear array of precisely sequenced nucleotide bases at first helped to 
clarify the nature of many mutational processes. Just as a sequence of 
letters in an English text might be altered either by changing individual 
letters one by one or by combining and recombining whole sections of 
text, so too might the genetic text be altered either one base at a time or 
by combining and recombining different sections of genes in various 
ways at random. Indeed, modern genetics has established various 
mechanisms of mutational change—not only “point mutations,” or 
changes in individual bases, but also duplications, insertions, inversions, 
recombinations, and deletions of whole sections of the genetic text. 

Although fully aware of this range of mutational options at nature’s 
disposal, Eden argued at Wistar that such random changes to written 
texts or sections of digital code would inevitably degrade the function of 
information-bearing sequences, particularly when allowed to 
accumulate.* For example, the simple phrase “One if by land and two if 
by sea” will be significantly degraded by just a handful of random 
changes such as those in bold: “Ine if bg lend and two ik bT Nea.” At 
the conference the French mathematician Marcel Schiitzenberger agreed 
with Eden’s concerns about the effect of random alterations. He noted 
that if someone makes even a few random changes in the arrangement of 


the digital characters in a computer program, “we find that we have no 
chance (i.e., less than 1/10!°°°) even to see what the modified program 
would compute: it just jams.” Eden argued that much the same problem 
applied to DNA—that insofar as specific arrangements of bases in DNA 
function like digital code, random changes to these arrangements would 
likely efface their function, while attempts to generate completely new 
sections of genetic text by random means were likely doomed to 
failure.’ 

The explanation for this inevitable diminution in function is found in 
a branch of mathematics called combinatorics. Combinatorics studies the 
number of ways a group of things can be combined or arranged. At one 
level, the subject is fairly intuitive. If a thief slips round the corner of a 
dormitory after hours looking for a bike to steal, he will scan the bike 
rack for an easy target. If he spots a basic bicycle-style lock with only 
three dials of ten numbers each, and on the rack beside it one with five 
dials of ten numbers each, the thief wouldn’t need a degree in 
mathematics to realize which one he should attempt to open. He knows 
that he would need to search fewer total possibilities with the three-dial 
lock. 

A straightforward calculation supports his intuition. The simpler lock 
has only 10 x 10 x 10, or 1000, possible combinations of digits—or 
what mathematicians refer to as “combinatorial” possibilities. The five- 
dial lock has 10 x 10 x 10 x 10 x 10, or 100,000, combinatorial 
possibilities. With a lot of patience, the thief might elect to 
systematically work his way through the different combinations of digits 
on the simpler lock, knowing that at some point he will stumble across 
the correct combination. He shouldn’t even bother with the five-dial 
lock, since making his way through all of the possible combinations on it 
would take 100 times as long. The five-dial lock simply has too many 
possibilities for the thief to have a reasonable chance of opening it by 
trial and error in the time available to him. 

Several of the Wistar scientists noted that the mutation and selection 
mechanism faces a similar problem. Neo-Darwinism envisions new 
genetic information arising from random mutations in the DNA. If at 


any time from birth to reproduction the right mutation or combination of 
mutations accumulate in the DNA of cells involved in reproduction 
(whether sexual or asexual), then information for building a new protein 
or proteins will pass on to the next generation. When that new protein 
happens to confer a survival advantage on an organism, the genetic 
change responsible for the new protein will tend to be passed on to 
subsequent generations. As favorable mutations accumulate, the features 
of a population will gradually change over time. 

Clearly, natural selection plays a crucial role in this process. 
Favorable mutations are passed on; unfavorable mutations are weeded 
out. Nevertheless, the process can only select variations in the genetic 
text that mutations have first produced. For this reason, evolutionary 
biologists typically recognize that mutation, not natural selection, 
provides the source of variation and innovation in the evolutionary 
process. As evolutionary biologists Jack King and Thomas Jukes put it 
in 1969, “Natural selection is the editor, rather than the composer, of the 
genetic message.”® 

And that was the problem, as the Wistar skeptics saw it: random 
mutations must do the work of composing new genetic information, yet 
the sheer number of possible nucleotide base or amino-acid 
combinations (i.e., the size of the combinatorial “space’’) associated with 
a single gene or protein of even modest length rendered the probability 
of random assembly prohibitively small. For every sequence of amino 
acids that generates a functional protein, there are a myriad of other 
combinations that don’t. As the length of the required protein grows, the 
number of possible amino-acid combinations mushrooms exponentially. 
As this happens, the probability of ever stumbling by random mutation 
onto a functional sequence rapidly diminishes. 

Consider another illustration. The two letters X and Y can be 
combined in four different two-letter combinations (XX, XY, YX and 
YY). They can be combined in eight different ways for three-letter 
combinations (XXX, XXY, XYY, XYX, YXX, YYX, YXY, YYY), 
sixteen ways for four-letter combinations, and so on. The number of 
possible combinations grows exponentially—27, 2°, 24, and so on—as 


the number of letters in the sequence grows. Mathematician David 
Berlinski calls this the problem of “combinatorial inflation,” because the 
number of possible combinations “inflates” dramatically as the number 
of characters in a sequence grows (see Fig. 9.2). 

The combinations of bases in DNA are subject to combinatorial 
inflation of just this sort. The information-bearing sequences in DNA 
consist of specific arrangements of the four nucleotide bases. 
Consequently, there are four possible bases that could occur at each site 
along the DNA backbone and 4 x 4, or 4, or 16 possible two-base 
sequences (AA AT AG AC TA TG TC TT CG CT CC CA GA GG GC 
GT). Similarly, there are 4 x 4 x 4, or 4°, or 64 possible three-base 
sequences. (Pll refrain from listing them all.) That is, increasing the 
number of bases in a sequence from 1 to 2 to 3 increases the number of 
possibilities from 4 to 16 to 64. As the sequence length continues to 
grow, the number of combinatorial possibilities corresponding to 
sequences of increasing length inflates exponentially. For example, there 
are 4! or 10°, possible ways of arranging one hundred bases in a row. 
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FIGURE 9.2 

The problem of combinatorial inflation as illustrated by bike locks of 
varying sizes. As the number of dials on the bike locks increases, the 
number of possible combinations rises exponentially. 


The amino-acid chains are also subject to such inflation. A chain of 
two amino acids could display 20°, or 20 x 20, or 400 possible 
combinations, since each of the twenty protein-forming amino acids 
could combine with any one of that same group of twenty in the second 
position of a short peptide chain. With a three-amino-acid sequence, 
were looking at 20°, or 8,000, possible sequences. With four amino 
acids, the number of combinations rises exponentially to 20+, or 
160,000, total combinations, and so on. 

Now, the number of combinatorial possibilities corresponding to a 
chain with four amino acids only marginally outstrips the combinatorial 
possibilities associated with the five-dial lock in my first illustration 
(160,000 vs. 100,000). It turns out, however, that many necessary, 
functional proteins in cells require far, far more than just four amino 
acids linked in sequence, and necessary genes require far, far more than 
just a few bases. Most genes—sections of DNA that code for a specific 
protein—consist of at least one thousand nucleotide bases. That 
corresponds to 4!000—an unimaginably large number—possible base 
sequences of that length. 

Moreover, it takes three bases in a group called a codon to designate 
one of the twenty protein-forming amino acids in a growing chain during 
protein synthesis. If an average gene has about 1000 bases, then an 
average protein would have over 300 amino acids, each of which are 
called “residues” by protein chemists. And indeed proteins typically 
require hundreds of amino acids in order to perform their functions. This 
means that an average-length protein represents just one possible 
sequence among an astronomically large number—20°"", or over 10390 
—of possible amino-acid sequences of that length. Putting these 
numbers in perspective, there are only 10° atoms in our Milky Way 
galaxy and 108° elementary particles in the known universe. 


That is what bothered Eden and other mathematically inclined 
scientists at Wistar. They understood the immensity of the combinatorial 
spaces associated with even single genes or proteins of average length. 
They realized that if the mutations themselves were truly random—that 
is, if they were neither directed by an intelligence nor influenced by the 
functional needs of the organism (as neo-Darwinism stipulates)—then 
the probability of the mutation and selection mechanism ever producing 
a new gene or protein could well be vanishingly small. Why? The 
mutations would have to generate, or “search” by trial and error, an 
enormous number of possibilities—far more than were realistic in the 
time available to the evolutionary process. 

Eden pointed out in his Wistar presentation that the combinatorial 
space corresponding to an average-length protein (which he assumed to 
be about 250 amino acids long) is 207°°—or about 103*°—possible 
amino-acid arrangements. Did the mutation and selection mechanism 
have enough time—since the beginning of the universe itself—to 
generate even a small fraction of the total number of possible amino-acid 
sequences corresponding to a single functional protein of that length? 
For Eden, the answer was clearly no. 

For this reason, Eden thought mutations had virtually no chance of 
producing new genetic information. He likened the probability of 
producing the human genome by relying on random mutations to that of 
generating a library of a thousand volumes by making random changes 
or additions to a single phrase in accord with the following instructions: 
“Begin with a meaningful phrase, retype it with a few mistakes, make it 
longer by adding letters [at random], and rearrange subsequences in the 
string of letters; then examine the result to see if the new phrase is 
meaningful. Repeat this process until the library is complete.”’ Would 
such an exercise have a realistic chance of succeeding, even granting it 
billions of years? Eden thought not. 

In addition, Schiitzenberger emphasized that randomly cutting and 
pasting larger blocks of text, as evolutionary biologists often envision, 
would not make any appreciable difference to the efficacy of a random 
search of sequence space. Imagine a computer “mutating” at random the 


text of the play Hamlet either by individual-letter substitutions or by 
duplicating, swapping, inverting, or recombining whole sections of 
Shakespeare’s text. Would such a computer simulation have a realistic 
chance of generating a completely different and equally informative text 
such as, say, The Blind Watchmaker by Richard Dawkins, even granting 
multiple millions of undirected mutational iterations? 

Schiitzenberger didn’t think so. He noted that making random 
changes “at the typographic level” in a computer program inevitably 
degrades its function whether those changes are made “by letters or by 
blocks, the size of the unit really does not matter.”*® Thus, he thought that 
a process of randomly shuffling blocks of text in any “typographic 
typology” would inevitably degrade meaning in much the same way that 
a series of individual-letter substitutions will. 

Schiitzenberger insisted that the evolutionary process faced similar 
limitations. To him, it seemed extremely unlikely that random mutations 
of whatever sort would produce significant amounts of novel and 
functionally specified information within the time available to the 
evolutionary process. 

Subsequent to the confirmation of Crick’s sequence hypothesis, all 
present at Wistar understood that the entities that confer functional 
advantages on organisms—new genes and their corresponding protein 
products—constitute long linear arrays of precisely sequenced subunits, 
nucleotide bases in the case of genes and amino acids in the case of 
proteins. Yet, according to neo-Darwinian theory, these complex and 
highly specified entities must first arise and provide some advantage 
before natural selection can act to preserve them. Given the number of 
bases present in genes, and amino acids present in functional proteins, a 
large number of changes in the arrangement of these molecular subunits 
would typically have to occur before a new functional and selectable 
protein could arise. For even the smallest unit of functional innovation— 
a novel protein—to arise, many improbable rearrangements of 
nucleotide bases would need to occur before natural selection had 
anything new and advantageous to select. 


Eden and others questioned whether mutations provided an adequate 
explanation for the origin of the genetic information necessary to build 
new proteins, let alone whole new forms of life. As physicist Stanislaw 
Ulam explained at the conference, the evolutionary process “seems to 
require many thousands, perhaps millions, of successive mutations to 
produce even the easiest complexities we see in life now. It appears, 
naively at least, that no matter how large the probability of a single 
mutation is, should it be even as great as one-half, you would get this 
probability raised to a millionth power, which is so very close to zero 
that the chances of such a chain seem to be practically non-existent.” 


LOOKING FOR A LOOPHOLE 


In his presentation at the conference, Eden himself acknowledged a 
possible way of resolving this dilemma. He suggested that it was at least 
possible that “functionally useful proteins are very common in this 
[ combinatorial] space so that almost any polypeptide one is likely to find 
[as the result of mutation and selection] has a useful function.”!° Many 
neo-Darwinian biologists subsequently came to favor this possible 
solution. The solution was this: even though the size of the 
combinatorial space that mutations needed to search was enormous, the 
ratio of functional to nonfunctional base or amino-acid sequences in 
their relevant combinatorial spaces might turn out to be much higher 
than Eden and others had assumed. If that ratio turned out to be high 
enough, then the mutation and selection mechanism would frequently 
stumble onto novel genes and proteins and could easily leapfrog from 
one functional protein island to the next, with natural selection 
discarding the nonfunctional outcomes and seizing upon the rare (but not 
too rare) functional sequences. 

As an electrical engineer who was used to working with computer 
code, Eden was intuitively disinclined to embrace this possibility. He 
noted that all codes and language systems can convey information 
precisely because they have rules of grammar and syntax. These rules 
ensure that not just any arrangement of characters will convey functional 


information. For this reason, functional sequences in working 
communications systems are typically surrounded in the larger 
combinatorial space by a multitude of nonfunctional sequences— 
sequences that don’t obey the rules. 

In known codes and language systems, functional sequences do 
indeed typically represent tiny islands of meaning amid a great sea of 
gibberish. Geneticist Michael Denton has shown that in English 
meaningful words and sentences are extremely rare among the set of 
possible combinations of letters of a given length, and they become 
proportionally rarer as sequence length grows.!! The ratio of meaningful 
12-letter words to 12-letter sequences is 1/10!*; the ratio of meaningful 
100-letter sentences to possible 100-letter strings has been estimated as 
1/10!°°. Denton used these figures in 1985 to explain why random letter 
substitutions inevitably degrade meaning in English text after only a few 
changes and why the same thing might be true of the genetic text. 

Given the alphabetic or “typographic” character of genetic 
information stored in DNA, Murray Eden and others at Wistar suspected 
that the same kind of problems would affect random mutational changes 
in DNA. It seemed logical that functional genes and proteins were also 
surrounded in their relevant combinatorial spaces by vast numbers of 
nonfunctional sequences—and, further, that the ratio of functional to 
nonfunctional sequences would also be exceedingly small. 

Yet in 1966 none of the scientists on either side of the debates at 
Wistar knew how rare or common functional gene and amino-acid 
sequences are among the corresponding space of total possibilities. Do 
they occur with a frequency of 1 in 10, 1 in a million, or 1 in a million 
billion trillion? At the time, these questions could not be answered. 

Most evolutionary biologists remained optimistic that the answer to 
this question would vindicate the neo-Darwinian model.'!? And some 
developments supported their confidence. During the late 1960s, 
molecular biologists learned that most of the functional roles performed 
by proteins are performed not just by one precise kind of protein, but by 
a wide variety, each with its own amino-acid sequence. This is unlike a 
bike lock, which has only one functional combination. Indeed, molecular 


biologists learned that though some amino acids at certain sites are 
absolutely essential for any particular protein to work, most sites tolerate 
amino-acid substitutions without loss of protein function. For many 
biologists, this suggested that mutation and selection had a reasonable 
chance of generating functional sequences of nucleotide bases or amino 
acids after all—that the ratio of functional to nonfunctional sequences 
was much higher than skeptics had anticipated. 

How much higher? How much variability is allowed in the amino- 
acid sequences in proteins? Are there enough functional proteins within 
a relevant combinatorial space of possibilities to render a random 
mutational search for new proteins plausible? 

When Denton compared linguistic and genetic text to explain the 
potential severity of the combinatorial inflation facing the neo- 
Darwinian mechanism, he noted that biologists still didn’t know enough 
“to calculate with any degree of certainty the actual rarity of functional 
proteins.” He concluded, however, that since future experiments surely 
would continue to deepen molecular biology’s fund of knowledge, “it 
may be that before long quite rigorous estimates may be possible.” !3 


IN SEARCH OF THE RATIO 


Denton’s prediction of imminent progress proved correct. During the 
late 1980s and early 1990s, Robert Sauer, a molecular biologist at MIT, 
performed a series of experiments that first attempted to measure the 
rarity of proteins within amino-acid sequence space. 

Sauer’s work exploited, for the first time, new technology that 
allowed for the systematic manipulation of gene sequences. Before the 
late 1970s, scientists typically used radiation and chemicals to produce 
mutant forms of DNA. Though these techniques sometimes paid off with 
dramatic results, such as mutant fruit flies with legs growing out of their 
heads (the famed Antennapedia mutation), they did not allow scientists 
to dictate or target any specific change to a sequence of bases in DNA. 
The treatments used simply replicated the conditions under which 
mutations occur naturally. 


During the late 1970s and early 1980s, however, molecular biologists 
developed technologies for making customized synthetic DNA 
molecules. Robert Sauer used these techniques to make site-directed 
changes to DNA sequences of specific genes of known function and then 
to insert those variants into bacterial cells. He could then evaluate the 
effect of various targeted alterations to a DNA sequence on the function 
of their protein products within a bacterial cell culture. 

Sauer’s technique allowed him to begin to evaluate how many of the 
variant sequences, as a percentage of the total, still produced a functional 
form of the relevant protein (see Fig. 9.3). His initial results confirmed 
that proteins could indeed tolerate a variety of amino-acid substitutions 
at many of the sites in the protein chain. Yet his experiments also 
suggested that functional proteins might be incredibly rare among the 
space of all possible amino-acid sequences. Based on one set of 
mutagenesis experiments, Sauer and his colleagues estimated the ratio of 
functional to nonfunctional amino-acid sequences at about 1 to 10° for 
a short protein of 92 amino acids in length. !4 

This result was in rough agreement with an earlier estimate by 
information theorist Hubert Yockey.'!° Yockey did not perform 
experiments to derive his estimate of the rarity of proteins in 
combinatorial sequence space. Instead, he used already published data to 
compare variants of the similar cytochrome c proteins (proteins involved 
in the biochemical pathways that generate energy in cells) in different 
species. He did this to see how much variability existed at each amino- 
acid site in molecules performing the same function with the same basic 
structure. Using this data about the allowable variability at each site, he 
estimated the probability of finding one of the allowable sequences 
among the total number of sequences corresponding to a cytochrome c 
protein 100 residues in length. He determined the ratio of functional to 
nonfunctional sequences to be about 1 to 10° for amino-acid chains of 
this length.'° So, although Sauer’s experimentally derived results were 
numerically different from Yockey’s, both approaches gave extremely 
low ratios suggesting that functional proteins are indeed rare in sequence 


space, even if proteins do admit significant variability in the specific 
amino acids present at various positions. 
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FIGURE 9.3 
Figure 9.3a (top) depicts the problem of combinatorial inflation as it 
applies to proteins. As the number of amino acids necessary to produce a 
protein or protein fold grows, the corresponding number of possible 
amino-acid combinations grows exponentially. Figure 9.3b (bottom) 
poses graphically the question of the rarity of proteins in that vast 
amino-acid sequence space. 


Taken at face value, Sauer’s experiments appeared to yield 
contradictory conclusions. On the one hand, his results showed that 
many arrangements of amino acids could produce the same protein 
structure and function—that numerous amino-acid sequences populated 
amino-acid sequence space. On the other hand, the ratio of functional 
sequences to the total number of possible sequences corresponding to a 
sequence of roughly 100 amino acids appeared to be incredibly low, just 
1 to 10%. 

Nevertheless, it’s not hard to see how both of Sauer’s seemingly 
contradictory conclusions could be true. Recall the locks confronting my 
hypothetical bike thief. Commercially manufactured bike locks typically 
have only one combination of digits that will allow them to be opened. 
The combination that will open a typical bike lock specifies one digit on 
each dial. No variability at any dial is allowed. 

Now imagine a new kind of lock with three crucial differences from 
an ordinary lock. First, with this new alternative lock, there are four 
positions on every dial that may—in combination with other positions 
on other dials—open the lock. My bike thief would like this feature of 
this kind of lock, since it seems to allow more wiggle room at each dial. 
But he doesn’t like the two other features of this lock. For one, each dial 
displays one of 20 letters rather than one of 10 numeric digits. Second, 
instead of 5 dials, there are 100 dials. On the upside, because 4 of the 20 
letters on each of the 100 dials might work, there are 4!°°, or a whopping 
1060, correct combinations that will open the lock. That’s an 
astronomically large number of correct combinations. But, on the 
downside, there are 20 possible settings at each of 100 dials, which 
computes to 20!, or 10130, possible combinations, a number that totally 
dwarfs the number of correct combinations of dial settings. 

The bike thief would be happy to learn that every dial has “only” four 
possible correct positions on each dial, but in a trial-and-error process, 
he still has only 1 chance in 5 (4 out of 20) of landing on a possibly 
functional dial position for any dial; and this 1-in-5 chance must be 
negotiated 100 times. In other words, the 1-in-5 chance must be 
multiplied by the 100 dials on the monster lock in order to arrive at the 


probability of the thief stumbling upon a functional combination on a 
given try. The odds are 1 chance in 5!°° or—if we want to convert that to 
base 10—roughly 1 chance in 10’. The odds are that slender because 
the functional combinations—numerous as they are—are dwarfed by the 
number of total combinations. 

In the same way, Sauer established that though many different 
combinations of amino acids will produce roughly the same protein 
structure and function, the sequences capable of producing these 
functional outcomes are still extremely rare. He showed that for every 
functional 92-amino-acid sequence there are roughly another 10° 
nonfunctional sequences of the same length. To put that ratio in 
perspective, the probability of attaining a correct sequence by random 
search would roughly equal the probability of a blind spaceman finding 
a single marked atom by chance among all the atoms in the Milky Way 
galaxy—on its face clearly not a likely outcome.!” 


UNCERTAIN SITUATION 


Nevertheless, during the 1990s in the immediate wake of the publication 
of Sauer’s results, the implications of his work for evolutionary theory 
were not entirely clear. Even in the scientific paper in which Sauer 
reported his work, the abstract summarizing his results emphasized the 
tolerance to amino-acid substitution that proteins allow. Consequently, 
scientists on both sides of the discussion about Darwinian evolution 
seized on different aspects of Sauer’s findings either to support or 
challenge the plausibility of the neo-Darwinian account of the origin of 
genes and proteins. 

Scientists sympathetic to neo-Darwinism emphasized the tolerance of 
proteins to amino-acid substitution; critics of the theory emphasized the 
rarity of proteins in sequence space. One scientist, Ken Dill, a 
biophysicist at University of California, San Francisco, cited Sauer’s 
work to suggest that nearly any amino acid would work at any site in a 
protein chain, provided the amino acids in question exhibited the correct 


hydrophobicity (water-repelling) or hydrophilicity (water-attracting) 
properties. !8 

Yet, at least one scientist, Lehigh University biochemist Michael 
Behe, cited Sauer’s quantitative estimate of the rarity of proteins as a 
decisive refutation of the creative power of the mutation and selection 
mechanism altogether.'? So by the mid-1990s, though Sauer and his 
group had initiated a program of experimental research that addressed 
the key question that Murray Eden raised at Wistar, that question had 
still not been completely settled. Did the mutation and natural selection 
mechanism have a realistic chance of finding the new genes and proteins 
necessary to build, for example, a new Cambrian animal? Answering 
that would await an even more systematic and comprehensive 
experimental regime. 
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The Origin of Genes and Proteins 


As a Ph.D. student in chemical engineering at the California Institute of 
Technology in the late 1980s, Douglas Axe (see Fig. 10.1) became 
interested in evolutionary theory after several fellow graduate students 
read the then-bestselling book by Richard Dawkins, The Blind 
Watchmaker. Axe’s compatriots were quickly converted to zealous 
advocates of Dawkins’s arguments and urged him to read the book for 
himself. Axe was impressed by the clarity of Dawkins’s writing and 
illustrations, but he found his case for the creative power of natural 
selection and random mutations unpersuasive. Whether in the analogies 
he drew to animal breeding or the computer simulations he used to 
demonstrate the supposed ability of mutation and selection to generate 
new genetic information, Dawkins repeatedly smuggled in the very thing 
he insisted the concept of natural selection expressly precluded: the 
guiding hand of an intelligent agent. 

He found Dawkins’s computer simulation particularly interesting. In 
The Blind Watchmaker, Dawkins described how he had programmed a 
computer to generate the Shakespearean phrase: “Me thinks it is like a 
weasel.”! Dawkins did this in order to simulate how random mutations 
and natural selection could generate new functional information. 

Dawkins programmed the computer first to generate many separate 
strings (sequences) of English letters. He then programmed it to compare 
each string to the Shakespearean target phrase and select only the string 
that most closely resembled that target.2 The program then generated 
variant versions of that newly selected string and compared those 
sequences to the target, selecting, again, only the one that most closely 


resembled the desired target. This eventually generated—after many 
iterations—a string that matched the target perfectly. 

Axe recognized immediately the role that Dawkins’s own intelligence 
had played. Not only did Dawkins provide the program with the 
information that he wanted it to generate (“Me thinks it is like a 
weasel”), he imbued the computer with a kind of foresight by directing it 
to compare the variant sequences of letters with the desired target. Axe 
realized that Dawkins’s program did not simulate natural selection, 
which by definition is neither guided toward nor given information 
about a desired outcome generations in the future. 

Axe began to wonder if there was some other way to assess the 
creative power of the mutation and natural selection mechanism, not 
with clever analogies or simplistic computer simulations, but with 
experimental and mathematical rigor. 

Axe realized that Dawkins was right about one thing: the importance 
of genetic information. Like his fellow engineer Murray Eden, Axe’s 
tendency to view biology as an engineer led him to ask whether 
selection and mutation could actually build new organisms. Axe’s own 
research explored the connection between process control (a field of 
study in engineering) and genetic regulation, a sophisticated version of 
automated process control at work on a molecular scale inside living 
cells. Since cells use proteins to perform various feats of regulation, Axe 
was acutely aware that building new organisms necessarily involved 
building new proteins, which would in turn require new genetic 
information. 





FIGURE 10.1 
Douglas Axe. Courtesy Brittnay Landoe. 


But could mutation and selection generate the precise arrays of 
nucleotide bases necessary to build fundamentally new protein 
structures? Axe’s interest in this question eventually led him to the 
scientific papers of Robert Sauer and the proceedings of a seemingly 
obscure 1966 Philadelphia conference called “Mathematical Challenges 
to the Neo-Darwinian Interpretation of Evolution.” 


UNRESOLVED ISSUES 


As Axe read the papers that Sauer’s research group had produced, he 
realized their importance as a first step toward answering the questions 
Murray Eden had raised at Wistar. If Sauer’s quantitative measures of 
rarity held up, then Axe thought it obvious that mutation and selection 
could not adequately search a space that large. If, on the other hand, 
subsequent mutagenesis experiments overturned Sauer’s work and 
showed that protein function was largely indifferent to changes in 
amino-acid sequence, then the number of functional sequences might be 
large enough that mutation and selection would have a good chance of 
finding new functional genes and proteins in a reasonable amount of 
time. 

After completing his Ph.D., Axe made inquiries about doing 
postdoctoral research in a top research lab where he could address these 
unanswered questions. He was soon invited by Alan Fersht, a professor 
at the University of Cambridge and director of the Centre for Protein 
Engineering, part of the world-famous Medical Research Council 
(MRC) Centre at Cambridge, to join his research group. 

The decision to accept Fersht’s offer was an easy one. The star- 
studded history of the adjoining Laboratory for Molecular Biology 
(LMB), arguably the birthplace of molecular biology, included such 
luminaries as James Watson, Francis Crick, Max Perutz, John Kendrew, 
Sidney Brenner, and Fred Sanger. Starting in the chemistry department 
and then moving to the MRC Centre, Axe hoped to apply his research 


training to resolve the uncertainty that surrounded the interpretation of 
Sauer’s results. Specifically, he wanted to eliminate what he saw as two 
sources of error in Sauer’s method in order to get a more definitive 
estimation of the frequency of functional sequences in sequence space. 

Axe thought, first, that Sauer’s team might have underestimated the 
rarity of functional proteins. In their experiments, Sauer’s group tested 
the tolerance of proteins to amino-acid substitution by changing amino 
acids at one or a few consecutive sites without making any other 
changes to other sites at the same time, much like a typist introducing an 
isolated typographic error in an otherwise accurately transcribed text. 
Not surprisingly, Sauer and his colleagues found that many sites along a 
protein chain could tolerate these isolated amino-acid substitutions, just 
as the reader of a text with only a few typos can often make out its 
meaning. Sauer’s team seemed to be assuming that a similar tolerance 
would have emerged if they had changed many sites simultaneously. 

Axe thought that this assumption ignored the importance of the larger 
context provided by the mostly unaltered protein. A single typographic 
error typically will not totally destroy the meaning of a section of 
English text, because of the surrounding context provided by the other 
words, as well as the correct letters in the altered word. That does not 
mean, however, that specificity of sequence doesn’t matter. Instead, the 
meaning of a sentence with a typographic error can be discerned only 
because the rest of the letters are specifically arranged into meaningful 
words and phrases that provide a context for determining the meaning of 
the incorrectly spelled word. For just this reason, however, the meaning 
of a sentence is rapidly degraded if errors are allowed to accumulate at 
multiple sites. 

Axe wondered if much the same could be true of genes and proteins. 
He wondered whether multiple, as opposed to single, position changes 
would quickly degrade function and whether a tolerance for substitutions 
at individual sites was itself context dependent—whether the tolerance 
for substitution at one site might depend upon having highly specific 
sequences at other sites. Thus, without questioning Sauer’s experimental 
findings, Axe thought Sauer’s result lent itself to misinterpretation. To 


many molecular biologists, it suggested that proteins could readily 
accommodate many simultaneous changes to their amino-acid sequences 
at many positions and remain functional. 

As it turns out, Sauer recognized the potential for misinterpretation of 
his results. As he explained in the very paper in which he developed his 
quantitative estimate of rarity, “this calculation overestimates the 
number of functional sequences, since changes at individual positions 
are less likely to be independent of one another as more positions are 
allowed to vary.”° 

Another assumption in Sauer’s approach had potentially the opposite 
effect—exaggerating the rarity of functional proteins. Axe thought the 
test that Sauer and his colleagues used to decide whether their mutant 
proteins were functional required a higher level of function than natural 
selection might require. Sauer and his team judged proteins with less 
than about 5—10 percent of the function seen in the natural protein to be 
nonfunctional. Yet Axe knew that even damaged enzymes with less than 
5 percent of normal activity could add significantly more benefit than no 
enzymatic activity at all. Thus, from a neo-Darwinian point of view, the 
emergence of even such handicapped proteins might confer a selectable 
advantage on an organism. Axe thought that by rejecting as 
nonfunctional such mutated sequences, Sauer’s team probably had 
introduced another estimation error. These competing errors made it 
hard to know if the estimate made by Sauer’s team was too high or too 
low or whether perhaps they might neatly cancel each other out. To 
eliminate both sources of possible error, Axe carefully designed a new 
series of experiments. 


THE IMPORTANCE OF FOLDS 


Axe had a key insight that animated the development of his experimental 
program. He wanted to focus on the problem of the origin of new protein 
folds and the genetic information necessary to produce them as a critical 
test of the neo-Darwinian mechanism. Proteins comprise at least three 
distinct levels of structure:+ primary, secondary, and tertiary, the latter 


corresponding to a protein fold. The specific sequence of amino acids in 
a protein or polypeptide chain make up its primary structure. The 
recurring structural motifs such as alpha helices and beta strands that 
arise from specific sequences of amino acids constitute its secondary 
structure. The larger folds or “domains” that form from these secondary 
structures are called tertiary structures (see Fig. 10.2). 

Axe knew that as new life-forms arose during the history of life—in 
events such as the Cambrian explosion—many new proteins must also 
have arisen. New animals typically have new organs and cell types, and 
new cell types often call for new proteins to service them. In some cases 
new proteins, while functionally new, would perform their different 
functions with essentially the same fold or tertiary structure as earlier 
proteins. But more often, proteins capable of performing new functions 
require new folds to perform these functions. That means that explosions 
of new life-forms must have involved bursts of new protein folds as 
well. 


AMINO ACIDS 





SEVERAL BETA STRANDS 
FORMING A BETA SHEET 


ALPHA HELIX 





A PROTEIN FOLD 


FIGURE 10.2 

Different levels of protein structure. The first panel at the top shows the 
primary structure of a protein: a sequence of amino acids forming a 
polypeptide chain. The second panel depicts, in two different ways, two 
secondary structures: an alpha helix (left), and beta strands forming a 
beta sheet (right). The third panel at the bottom shows, in two different 
ways, a tertiary structure—that is, a protein fold. 


The late geneticist and evolutionary biologist Susumu Ohno noted 
that Cambrian animals required complex new proteins such as, for 
example, lysyl oxidase in order to support their stout body structures. 
When these molecules originated in Cambrian animals, they also likely 
represented a completely novel folded structure unlike anything present 
in Precambrian forms of life such as sponges or one-celled organisms. 
Thus, Axe was convinced that explaining the kind of innovation that 
occurred during the Cambrian explosion and many other events in the 
history of life required a mechanism that could produce, at least, 
distinctly new protein folds. 

He had another reason for thinking that the ability to produce novel 
protein folds provided a critical test for the creative power of the 
mutation and selection mechanism. As an engineer, Axe understood that 
building a new animal required innovation in form and structure. As a 
protein scientist, he understood that new protein folds could be viewed 
as the smallest unit of structural innovation in the history of life. 

It follows that new protein folds represent the smallest unit of 
structural innovation that natural selection can select. Of course, natural 
selection can operate on smaller units of change—individual amino-acid 
changes that result in slight functional advantages or fitness gains, but 
not new folds, for example. But what if the functional or fitness gains 
that natural selection preserves and passes on never generate structural 
innovations? What if, instead, it only preserves slight differences in the 
sequence or function of proteins that confer an advantage without 
altering structure? Then, clearly, fundamental changes in the form of an 
organism will not occur. Building fundamentally new forms of life 


requires structural innovation. And new protein folds represent the 
smallest selectable unit of such innovation. Therefore, mutations must 
generate new protein folds for natural selection to have an opportunity to 
preserve and accumulate structural innovations. Thus, Axe realized that 
the ability to produce new protein folds represents a sine qua non of 
macroevolutionary innovation. 

Could random mutations generate such novel protein folds? Axe 
realized that answering this question depended upon measuring the rarity 
of functional genes and proteins in sequence space and determining 
whether random genetic mutations would have enough opportunities to 
search the relevant sequence spaces within evolutionary time. 


AXE’S INITIAL RESULTS 


Axe read the paper by Sauer and his colleague John Reidhaar-Olson that 
estimated the proportion of functional protein sequences to be extremely 
low (1 in 10°). He noticed that the authors chose not to emphasize this 
measure of rarity, however, but instead the variety of amino-acid 
substitutions that the protein under study could tolerate. 

In their paper, Reidhaar-Olson and Sauer also repeated a then popular 
idea that the amino acids buried in the interior of a folded protein 
(forming what is known as the hydrophobic core) are most important for 
specifying the structure, while the arrangement of the exterior amino 
acids did not matter nearly as much.’ They thought that the amino acids 
buried within folded proteins typically need only to be hydrophobic 
(water repelling), whereas the amino acids on the exterior, for the most 
part, need to be hydrophylic (water attracting). In fact, some protein 
scientists thought that these simple restrictions might be the whole story 
—that a functional protein fold might require nothing more than an 
appropriate arrangement of hydrophobic and hydrophylic amino acids in 
a given sequence. 

At Fersht’s lab in Cambridge, Axe conducted an experimental test of 
this idea, and surprised himself with the first result. In a paper he 
coauthored in the Proceedings of the National Academy of Sciences in 


1996, he reported his findings. When he replaced the entire thirteen- 
residue hydrophobic core of a small enzyme with random combinations 
of other hydrophobic amino acids, a high fraction of the randomized 
proteins (about one-fifth) still performed their original function. This 
suggested that proteins were, perhaps, less susceptible to functional loss 
as a result of sequence changes than Axe had thought. 

Next he focused on the exterior of proteins, randomizing portions of 
two different proteins’ exteriors in much the same way that he had 
randomly changed the interior of one of them. This time his approach 
failed to produce any functional variants at all. Realizing that this 
seemed to contradict what Sauer and others had supposed, Axe decided 
to make only much more restrictive changes in the next trial. He 
replaced each exterior amino-acid residue only with its most similar 
amino-acid alternative. Nevertheless, both of the proteins that he studied 
still lost all function by the time he had replaced one-fifth of their 
exterior residues. Thus he concluded that the exterior parts of the 
proteins were much more susceptible to functional loss as a result of 
amino-acid changes than had been widely assumed. 

In all this work, Axe designed his experiments to remedy the two 
sources of estimation error inherent in Sauer’s method. First, by 
studying amino-acid changes in combination rather than in isolation, he 
determined that the surrounding context typically did influence whether 
a particular amino-acid change at a particular site caused functional loss. 
In other words, he discovered that the neglect of the influence of the 
surrounding context had the effect of exaggerating the tolerance to 
amino-acid changes at particular sites, as he (and Sauer) had suspected. 

Second, the proteins that Axe chose for his study made it possible for 
function to be detected at much lower levels than was possible in Sauer’s 
studies. For one protein Axe studied, the more sensitive test of function 
did indeed allow a greater proportion of single mutants to retain some 
function, with about 95 percent of the mutant proteins achieving the 
designation “active.” This suggested that Sauer’s less sensitive screen 
did contribute to another source of estimation error, this time in the 
opposite direction. Nevertheless, Axe’s more sensitive screen also 


enabled him to establish that even though single mutations allow many 
proteins to retain some function, they still diminish or damage the 
function of the protein—often enough to ensure that they will be 
eliminated by the purifying effect of natural selection. Further, because 
of the extreme sensitivity of his test for function, Axe learned that any 
single mutation that failed his test was single-handedly destroying 
function. He determined that fully 5 percent of such changes did destroy 
protein function. 

Overall, therefore, he showed that despite some allowable variability, 
proteins (and the genes that produce them) are indeed highly specified 
relative to their biological functions, especially in their crucial exterior 
portions. Axe showed that whereas proteins will admit some variation at 
most sites if the rest of the protein is left unchanged, multiple as opposed 
to single amino-acid substitutions consistently result in rapid loss of 
protein function. This was the case even when these changes occur at 
sites that allow variation when altered in isolation. His new experiment 
also roughly confirmed Sauer’s earlier quantitative assessment of the 
rarity of functional proteins, despite the estimation errors inherent in 
Sauer’s method. Why? Because it appeared that Sauer’s two estimation 
errors—ignoring context and using an insufficiently sensitive screen for 
function—did, in fact, roughly cancel each other out. 

Despite these advances in understanding, Axe had not yet determined 
whether the greatly restricted picture of tolerance that his work had 
exposed would cause problems for the evolution of new protein folds. In 
order to answer that question he would need to obtain a more precise 
quantitative estimate of the rarity of proteins in sequence space. 

Having developed a method that eliminated the main sources of 
estimation error in earlier mutagenesis experiments, Axe was now in a 
position to answer that question with unprecedented rigor. Once he did, 
he could determine whether random genetic changes would have enough 
opportunities—even on the scale of evolutionary time—to search the 
relevant sequence spaces for functional genes and proteins. 


DON’T LEAVE THE FOLD 


Of course, Axe understood that neo-Darwinists do not envision a 
completely random journey through nucleotide or amino-acid sequence 
space. They see natural selection acting to preserve useful mutational 
variations and to eliminate deleterious ones. Richard Dawkins, for 
example, likens an organism to a high mountain peak.’ He compares 
climbing the sheer precipice up the front side of the mountain to 
building a new organism purely by chance—random mutations alone. 
He acknowledges that this approach up “Mount Improbable” will not 
succeed. Nevertheless, he asserts that there is a gradual slope up the 
back side of the mountain that can be climbed in small incremental 
steps. In his analogy, the back side up “Mount Improbable” corresponds 
to the process of natural selection acting on many small random changes 
in the genetic text. What chance alone cannot do, natural selection acting 
on random mutations can accomplish through the cumulative effect of 
many slight successive steps. 

Yet Axe’s experimental results presented a problem not only for 
scenarios involving random mutations acting alone, but also for 
scenarios envisioning selection and random mutation acting in concert. 
Further, his mutagenesis experiments cast doubt on each of the two 
scenarios by which evolutionary biologists might envision new protein 
folds (and the information necessary to produce them) arising as the 
result of the mutation and selection mechanism. 

In theory, new genes capable of producing a new protein fold might 
arise either from (a) preexisting genes or from (b) nonfunctional sections 
of the genome. That is—to adapt Dawkins’s visual analogy—mutation 
and natural selection could conceivably generate a new functional gene 
starting from either (a) another mountain peak (a different preexisting 
functional gene) or (b) from the valley floor (a nonfunctional section of 
the genome). Yet Axe’s experimental results would show that the action 
of natural selection would not help solve the search problem confronting 
the mutation mechanism in either of these two cases. To see why, we 
need to understand a bit more about each of these two possible neo- 
Darwinian scenarios as well as Axe’s subsequent experimental findings. 


FROM PEAK TO PEAK 


In the first case, evolutionary biologists might envision mutation and 
selection gradually altering a preexisting gene (and its protein product) 
to produce another functional gene (and a different protein product). 
This scenario involves moving metaphorically from one functional peak 
to another without dipping into a valley (a zone of diminished fitness or 
nonfunction). 

Most evolutionary biologists reject this first scenario. They do so 
because they recognize that mutations in preexisting genes will typically 
degrade functional genetic information. They know, too, that when 
genes lose function, natural selection will eventually eliminate the 
organisms that possess these genes. Genes that contribute to the healthy 
function of an organism, that have been mutated in such a way as to 
diminish that function, will be subject to what evolutionary biologists 
call “purifying selection.” That is, natural selection will typically 
eliminate organisms possessing mutation-induced gene variants that 
diminish function or fitness. (When natural selection preserves genetic 
changes that enhance function or fitness, evolutionary biologists call that 
“positive selection.”’) 

Axe’s mutagenesis experiments confirmed these reasons for doubting 
the first of the possible neo-Darwinian scenarios, at least as an 
explanation for the origin of new protein folds. In work that he published 
in 2000, he showed that it is, indeed, exceedingly difficult to make 
extensive changes to functional amino-acid sequences without 
destabilizing a protein fold. Even best-case changes involving the most 
chemically similar amino acids in the exterior of proteins tended to 
destabilize protein folds. 

In these experiments, Axe mutated a gene that produced a protein 
exhibiting a single fold and function. He found that as he altered this 
protein, multiple position changes in the exterior of the protein molecule 
quickly effaced or destroyed its function.’ Yet to turn one protein with a 
distinctive folded structure into another with a completely novel 
structure and function requires specified changes at many, many sites— 


far more than Axe altered in his experiments.'? The number of changes 
necessary to produce a new protein fold typically exceeds the number of 
changes that will result in functional loss. Given this, the probability of 
the evolutionary process successfully traversing a functional landscape 
from one functional peak to another—all the while escaping functional 
loss each step along the way—is extremely small, with the probability 
diminishing exponentially with each additional requisite change.!! 
Indeed, by showing that functional proteins with distinct folds are far 
more sensitive to functional loss than protein scientists had previously 
assumed, Axe’s experiments confirmed what most evolutionary 
biologists suspected—namely, that protein-to-protein (or functional 
gene—to—functional gene) evolution is a no-go where the mutation and 
selection mechanism must produce a new protein fold (see Fig. 10.3). 

Axe had a more fundamental reason for considering the first 
evolutionary scenario implausible. Based on the physical principles of 
protein function, the vast majority of protein functions simply cannot be 
performed by unfolded proteins. In other words, stability of protein 
structure is a precondition of protein function. Destabilized protein folds 
not only lose the three-dimensional structures they need to perform 
functional tasks, they are also vulnerable to attack by other proteins 
called proteases that devour unfolded proteins or polypeptides in the 
cell.!? 

As one structure is degraded as the result of multiple sequence 
changes, it will necessarily lose structural stability, resulting in a 
catastrophic loss of function. Yet any diminution in the function of a 
protein will also diminish fitness in a way that will subject the protein 
(and its corresponding gene) to the purifying action of natural 
selection.!? Indeed, according to the equations of population genetics, 
the standard mathematical expression of neo-Darwinian theory, even 
slight losses in fitness will subject the disadvantageous traits that 
produce such losses to purifying selection, thus eliminating them. That 
means that even many protein sequences that retain a significant, though 
diminished, portion of their original function nevertheless will not 
survive the winnowing effects of the neo-Darwinian mechanism. Thus, 


the gradual transformation of one functional fold into another was a 
complete nonstarter. 
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FIGURE 10.3 

This figure illustrates why many evolutionary biologists reject the idea 
that genes and proteins under selection pressure will evolve into new 
functional genes and proteins. Since genes, like English sentences, 
contain sequence-specific functional information, multiple changes in 
the genetic text will inevitably degrade function (or fitness) long before 
a new functional sequence will arise—just as random changes in a 
meaningful English sentence will typically destroy meaning long before 
such changes produce a significantly different meaningful sentence. 





Research performed at the European Molecular Biology Laboratory 
by molecular biologist Francisco Blanco has since confirmed this 
conclusion. Using site-directed mutagenesis, Blanco’s team found that 
the sequence space between two naturally occurring protein domains is 
not continuously populated by folded or functional proteins. By 
sampling intermediate sequences between two sequences that do adopt 
different folds, Blanco found that the intermediate sequences “lack a 
well-defined three-dimensional structure.” Thus, he concluded that “the 
appearance of a completely new fold from an existing one is unlikely to 
occur by evolution through a route of folded intermediate sequences.” !4 

Thus, both experimental results and the physics of protein folding 
implied that random searches for novel proteins starting from preexisting 
protein-coding genes would result in functional loss long before a 
protein with a novel fold would emerge, as most evolutionary biologists 


already suspected. Although the first of the two possible evolutionary 
scenarios has the advantage of starting on a mountain peak—with a 
functional gene and protein—it also has a lethal disadvantage: randomly 
mutating the gene will soon destabilize a protein fold and/or generate 
nonfunctional intermediate sequences and structures long before a new 
gene (capable of generating a new fold) would arise. For this reason, this 
scenario involves not so much a climb up Mount Improbable, but a step 
out over Valley Impassable. 


SCALING MOUNT IMPROBABLE 


For all these reasons, like most evolutionary biologists, Axe thought the 
second neo-Darwinian scenario—in which new genes and proteins 
emerge from nonfunctional or neutral regions of the genome—provides 
a much more plausible means of producing the information necessary to 
construct novel protein folds. It was to this scenario that Axe turned his 
experimental energies. 

In this scenario, neo-Darwinists envision new genetic information 
arising from sections of the genetic text that can vary freely without 
consequence to the organism. According to this scenario, noncoding 
sections of the genome or duplicated sections of coding regions undergo 
a protracted period of “neutral evolution”! in which alterations in 
nucleotide sequences have no discernible effect on the fitness of the 
organism. Functional genes and proteins gradually rise from a 
nonfunctional valley floor to a functional mountain peak—generating a 
new gene. Natural selection plays a role, but not until a new functional 
gene has arisen. 

Evolutionary biologists typically picture this process beginning with 
a gene duplication event. Although several different mechanisms can 
generate gene duplicates in DNA,!° the most common mechanism 
occurs during the crossing-over step of meiosis (a kind of cell division 
that produces sex cells, or gametes, in sexually reproducing organisms). 
During meiosis, homologous chromosomes swap segments of DNA. In a 
normal crossing-over event, corresponding chromosomal segments of 


equal size are swapped between the two homologous chromosomes, 
ensuring that both chromosomes experience no net gain or loss of genes. 
Sometimes, however, chromosomes swap genetic material of unequal 
length. When this happens, one chromosome (the one that gets the 
smaller piece) ends up losing some DNA, while the chromosome that 
receives the larger segment of genetic material ends up with a new 
stretch of chromosomal DNA—one that may include a gene or genes it 
already had. This results in duplicate copies of a gene on one 
chromosome. 

When this occurs, one of the two genes may begin to vary—to 
experience mutations—without adversely affecting the function of the 
organism, while the other performs the original function. In the jargon of 
evolutionary biology, mutational changes in gene duplicates are 
“selectively neutral’—they initially provide no advantage or 
disadvantage to an organism or population. These gene-duplication 
events allow nature room to experiment safely. Unhelpful but harmless 
genetic novelties can be passed on to future generations, where 
additional mutations one day may render the evolving genetic material 
useful. Eventually, as mutational changes accumulate, a new gene 
sequence may arise in a new organism that can code for a novel protein 
fold and function. At that point, natural selection can favor the new gene 
and its protein product, preserving and passing it along to future 
generations—or so the story goes. 

This scenario—which many evolutionary biologists now refer to as 
the “classical model” of gene evolution—has the advantage of allowing 
portions of the genome to vary freely through many generations, giving 
mutations many opportunities to “search” the space of possible base 
sequences without being punished for drifting into valleys of lost or 
diminished function. 

But this scenario faces an overriding problem: the extreme rarity of 
sequences capable of forming stable folds and performing biological 
functions. Since natural selection does nothing to help generate new 
folded, functional sequences, but rather can only preserve such 
sequences once they have arisen, random mutations alone must search 


for the exceedingly rare folded and functional sequences within the vast 
sea of combinatorial possibilities. 

And that is the big story associated with Axe’s experiments. His 
research showed that folded, functional sequences of amino acids are 
indeed exceedingly rare within sequence space. After his initial round of 
experiments, Axe performed another series of site-directed mutagenesis 
experiments on a 150-amino-acid protein-folding domain within a p- 
lactamase enzyme and published the results in the Journal of Molecular 
Biology.'’ Recall that a folding domain is a portion of a larger protein 
that exhibits a distinctive fold. Since amino-acid chains must first fold 
into stable three-dimensional structures, Axe performed experiments that 
enabled him to estimate the frequency of sequences that will produce 
stable folds—any stable fold—before he estimated the frequency of 
sequences performing a specific (f-lactamase) function. His improved 
experimental method produced a precise quantitative result. He 
estimated (a) the number of 150-amino-acid-long sequences capable of 
folding into stable “function-ready” folded structures compared to (b) 
the whole set of possible amino-acid sequences of that length (recall Fig. 
9.3). Based on his site-directed mutagenesis experiments, he determined 
that ratio to be a vanishingly small 1 in 1074. In other words, for 
sequences 150 amino acids long, only 1 in 1074 sequences will be 
capable of folding into a stable protein. 

For a sequence to achieve a protein fold is only a first step, however. 
A protein must be folded to be functional, but a folded protein is not 
necessarily a functional protein. And although sequences capable of 
forming stable protein folds are necessary to any significant evolutionary 
innovation, natural selection cannot select for the presence of a fold 
unless it also performs a function that confers a specific functional 
advantage on an organism. Thus, Axe also estimated (a) the number of 
proteins of modest length (150 residues) that perform a specified 
function via any folded structure compared to (b) the whole set of 
possible amino-acid sequences of that size. Based on his experiments 
and data about the number of stable folded proteins that exist, Axe 
estimated that ratio to be about 1 to 107”. A telling conclusion follows 


from this experimental data: The probability of any given mutational 
trial generating (or “finding”) a specific functional protein among all the 
possible 150 residue amino-acid sequences is 1 chance in 10’’—that is, 
one chance in one hundred thousand, trillion, trillion, trillion, trillion, 
trillion, trillion. 

That is obviously an incredibly small probability, but is it small 
enough to justify rejecting the classical model of gene evolution? Or is it 
plausible to think that random mutations in the nonfunctional part of the 
genome could overcome these long odds to generate the genetic 
information necessary to produce a novel protein fold with a specific 
selectable function? 


HOW MANY TRIALS? 


When statisticians or scientists assess whether a chance hypothesis 
provides a plausible explanation for the occurrence of an event, they do 
not just evaluate the probability of that particular event occurring once; 
they evaluate the probability of the event occurring given the number of 
opportunities it has to occur. 

For example, if our hypothetical bike thief from the previous chapter 
had enough time to try more than half (more than 500 of the 1000) total 
combinations of a three-dial bike lock, then the probability that he will 
stumble upon the right combination will exceed the probability that he 
will fail. In that case, it will be more likely than not that he will succeed 
in opening the lock by chance. In that case, the chance hypothesis—the 
hypothesis that he will succeed in opening the lock by chance—is more 
likely to be true than false. On the other hand, if just after he started 
trying to crack the lock he heard a security guard coming around the 
corner and only had time to explore a small fraction of the total number 
of possible combinations—far fewer than half—then it will be much 
more likely than not that he will fail to open the lock by chance. 
Consequently, anyone who knew his situation could conclude that the 
chance hypothesis is, in that case, much more likely to turn out false 
than true. 


When statisticians or scientists assess the probability of an event 
occurring by chance, they often assess what is called a conditional 
probability. In deciding the plausibility of a chance hypothesis, they 
assess the probability of the event given or “conditioned on” what else 
we know, especially what else is known about the number of 
opportunities the event has to occur. And they refer to the number of 
opportunities an event has to occur as “the probabilistic resources.” !® 

If the conditional probability of the chance hypothesis, given the 
number of opportunities it has to occur, is less than 1⁄2, then it is more 
likely than not that the event will not happen by chance. It will be 
viewed as implausible—more likely to be false than true. Conversely, if 
the conditional probability of the chance hypothesis, given the number 
of opportunities it has to occur, is more than '4, then it is more likely 
than not that the event in question will occur by chance. It will be 
deemed plausible—more likely to be true than false. And, of course, the 
smaller the conditional probability associated with a hypothesis, the 
more implausible the hypothesis—the more likely the chance hypothesis 
is to be false than true. 

How then should we assess the chance hypothesis for the origin of 
biological information—in particular, the hypothesis that random 
mutations generated the information necessary to produce a novel 
protein fold with a selectable function? What is the conditional 
probability that such a folded protein could arise as the result of random 
mutations in duplicated nonfunctional sections of a genome? Axe 
realized that in order to answer that question he needed a way to 
estimate the number of opportunities that random mutations had for 
producing a new protein fold with a selectable function during the whole 
history of life on earth. 


THE BIOLOGICAL UNIVERSAL PROBABILITY BOUND 


Here, evolutionary theory itself provides the answer. Axe was interested 
in the number of times that mutations might have produced new 
sequences of bases in DNA that were capable of producing a new 


sequence of amino acids—one of the 10%” possible sequences in the 
relevant sequence space. Yet not every such sequence of bases that 
mutations might generate constituted a relevant trial. In theory, 
mutations may alter a gene many times during the life cycle of an 
organism. Nevertheless, natural selection can only act on the new 
sequence of bases that is actually passed on to offspring. It might seem 
that it would be difficult to quantify the number of mutational trials in 
each generation. But even if we think of mutations repeatedly shuffling 
and reshuffling the arrangement of bases during the life cycle of an 
organism, only those mutations in the genes (or DNA) in the 
reproductive cells of parent organisms can have any effect on the next 
generation. Since Axe wanted to know how many novel sequences 
capable of generating a selectable function might have arisen in the 
history of life, he only needed to concern himself with those sequences 
that could be transmitted during reproduction. 

This meant that if he could estimate the total number of organisms 
that had lived during the history of life on earth and the number of new 
genes that mutations might produce and pass on to the next generation, 
he could establish an upper bound on the number of trials relevant to the 
evolutionary process. 

Axe knew that the huge population sizes of prokaryotes like bacteria 
dwarf the population sizes of all other organisms combined. Thus, 
estimates for the size of the bacterial population—plus a smidge for 
everything else—would approximate the size of the number of 
organisms living at any given time. Based on the average length of time 
of a bacterial generation and the time since the first appearance of 
bacterial life on earth (3.8 billion years ago), scientists have estimated 
that a total of about 107° organisms have lived on earth since life first 
appeared.!? Axe made the assumption that each new organism received 
one new sequence of bases (one potential gene) capable of generating 
one of the possible amino-acid sequences in sequence space per 
generation. 

This was an extremely generous assumption. Since mutations have to 
be quite rare for life to survive, most bacterial cells inherit an exact copy 


of their parent’s DNA. Furthermore, the ones that differ from their 
parents are likely to carry a mutation that has already occurred many 
times in other cells. For these reasons, the actual number of new 
sequences sampled in the history of life is much lower than the total 
number of bacterial cells that have existed. Nevertheless, Axe assumed 
that one new gene per organism has been transmitted to the next 
generation. Thus, he used 10% gene sequences as a liberal estimate of 
the total number of gene sequences (evolutionary trials) that have been 
generated to search sequence space in the history of life. 

Even so, 10% represents only a tiny fraction—1 ten trillion, trillion, 
trillionth—of 107’. Thus, the conditional probability of generating a 
gene sequence capable of producing a novel protein fold and function is 
still only 1 in 10°’. This means that if every organism from the dawn of 
time had generated, by random mutation, one new base sequence in the 
sequence space of interest, that would amount to only one 10 trillion, 
trillion, trillionth of the sequences in that space—the space that needs to 
be searched.And, since conditional probability of a new gene arising in 
the manner envisioned by the classical model turns out to be almost 
unimaginably less than 2, the classical model turns out to be vastly 
more likely to be false than true. Thus, Axe concluded that a reasonable 
person should reject it. The probabilistic resources available to the 
classical model of gene evolution are simply far too small to tame 1 
chance in 1077 (see Fig. 10.4). 
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The top panel in this diagram represents the results of Axe’s mutagenesis 
experiments showing the extreme rarity of functional proteins in 
sequence space. Based on his experiments Axe estimated that there are 
107” possible sequences corresponding to a specific functional sequence 
150 amino acids long. The second panel shows that functional amino- 
acid sequences are extremely rare even in relation to the total number of 
opportunities the evolutionary process would have had to generate novel 
sequences (on the assumption that each organism that has ever lived 
during the history of life produced one such sequence per generation). 


To appreciate why this model fails, consider the following 
illustration.After the 1975 Steven Spielberg film Jaws became a big hit, 
one small-town motel advertised “Shark-Free Pool.” Proponents of the 
second evolutionary scenario (gene duplication, followed by neutral 
evolution), envision an evolutionary pool where there are no 
consequences for mutational missteps—by analogy, a pool with no 
predators. But to extend the illustration, picture a predator-free pool the 
size of our galaxy. Now picture a blindfolded man dropped into the 
middle of it. He must swim to the far side, to the one spot on the edge of 
the pool where a ladder would give him a way out. He’s safe from 
predators, but it will do him no good. He needs direction, some way of 


gauging his progress, and an immense amount of time. But he has none 
of these and so he will arrive at the ladder in neither a hundred years, nor 
a hundred billion. Similarly, in the classical model of gene evolution, 
random mutations must thrash about aimlessly in immense 
combinatorial space, a space that could not be explored by this means in 
the entire history of life on earth, let alone in the few million years of the 
Cambrian explosion. 


TO BUILD AN ANIMAL 


Yet Axe’s calculations only hint at the full problem for neo-Darwinian 
theory. By bending over backwards not to overstate the improbability of 
generating a new protein fold and by focusing narrowly on that one 
aspect of the challenge confronting evolutionary theory, his figures 
vastly understate the improbability of building a Cambrian animal. 
There are several reasons for this. 

First, the Cambrian explosion as dated by fossil evidence took far less 
time than has elapsed since the origin of life on earth until the present 
(about 3.8 billion years). Less time available for a given evolutionary 
transition means fewer generations of new organisms and fewer 
opportunities to generate new genes by which to search relevant 
sequence space. This makes it even harder to generate a new protein fold 
by chance in the relevant time period. 

Second, bacteria are by far the most common type of organisms 
included in Axe’s estimate of the total number of organisms that have 
lived on earth. Yet no one thinks that Cambrian animals evolved directly 
from bacteria. Nor does anyone think that the putative multicellular 
ancestors of the Cambrian forms would have been anywhere near as 
abundant as the bacterial populations that Axe used as the main basis of 
his estimate. A more realistic estimate for the number of possible animal 
ancestors would necessarily result in a much lower estimate for the 
number of gene sequences available for searching sequence space 
(corresponding to a single protein of modest length in a single Cambrian 
animal). Recall that, based on Axe’s estimates, the probability of 


generating just one gene (for a new functional protein fold) from all the 
bacteria (and other organisms) that have ever lived on earth is just 1 in 
10 trillion, trillion, trillion. Consequently, the mechanism for searching 
sequence space that Axe determined to be extremely implausible must 
be judged far more implausible as a mechanism for producing the 
Cambrian information explosion, since there were far fewer 
multicellular organisms present in the Precambrian period than there 
have been total organisms present in the entire history of life. 

Third, building new animal forms requires generating far more than 
just one protein of modest length. New Cambrian animals would have 
required proteins much longer than 150 amino acids to perform 
necessary, specialized functions.*! For example, as previously noted, 
many of these Cambrian animals needed the complex protein lysyl 
oxidase to support their stout body structures. In addition to a novel 
protein fold, these molecules (in living organisms) comprise over 400 
precisely sequenced (nonrepeating) amino acids. Reasonable 
extrapolation from mutagenesis experiments done on shorter protein 
molecules suggests that the improbability of randomly producing 
functionally sequenced proteins of this length would be extremely 
unlikely to occur given the probabilistic resources (and duration) of the 
entire universe.7~ 

The mutation and selection mechanism faces a related obstacle. The 
Cambrian animals exhibit structures that would have required many new 
types of cells, each requiring many novel proteins to perform their 
specialized functions. But new cell types require not just one or two new 
proteins, but coordinated systems of proteins to perform their distinctive 
cellular functions. The unit of selection in such cases ascends to the 
system as a whole. Natural selection selects for functional advantage, 
but no advantage accrues from a new cell type until a system of 
servicing proteins is in place. But that means random mutations must, 
again, do the work of information generation without the help of natural 
selection—and, now, not simply for one protein, but for many proteins 
arising together. Yet the odds of this occurring by chance alone are, of 
course, far smaller than the odds of the chance origin of a single new 


gene or protein—so small as to render the chance origin of the 
information needed to build a new cell type fantastically improbable 
(and implausible) given even the most optimistic estimates for the length 
of the Cambrian explosion. 

Richard Dawkins has noted that scientific theories can rely on only so 
much “luck” before they cease to be credible.” But the second scenario, 
involving gene duplication and neutral evolution, by its own logic, 
precludes natural selection from playing a role in generating genetic 
information until after the fact. Thus, it relies entirely on “too much 
luck.” The sensitivity of proteins to functional loss, the rarity of proteins 
within combinatorial sequence space, the need for long proteins to build 
new cell types and animals, the need for whole new systems of proteins 
to service new cell types, and the brevity of the Cambrian explosion 
relative to rates of mutation—all conspire to underscore the immense 
implausibility of any scenario for the origin of Cambrian genetic 
information that relies upon random variation alone, unassisted by 
natural selection. 

Yet the classical model of gene evolution—which relies on neutral 
evolution—trequires novel genes and proteins to arise, precisely, by 
random mutation alone. Adaptive advantage accrues after the generation 
of new functional genes and proteins. Natural selection cannot play a 
role until new functional information-bearing molecules have 
independently arisen. Thus, to return to Dawkins’s imagery, evolutionary 
theorists envisioned the need to scale the steep face of a precipice of 
which there is effectively no gradually sloping back side, since the 
smallest increment of structural innovation in the history of life—a new 
protein fold—itself presents a formidable Mount Improbable. 

By the way, Axe’s later experiments establishing the extreme rarity of 
protein folds in sequence space also show why random changes to 
existing genes inevitably efface or destroy function before they generate 
fundamentally new folds or functions (scenario one). If only one out of 
every 107” of the alternate sequences are functional, an evolving gene 
will inevitably wander down an evolutionary dead-end long before it can 
ever become a gene capable of producing a new protein fold. The 


extreme rarity of protein folds also entails their isolation from each other 
in Sequence space. 


A CATCH-22 


Douglas Axe’s results highlight an acute dilemma for neo-Darwinism, a 
“catch-22.” On the one hand, if natural selection plays no role in 
generating new genes, as the idea of neutral evolution implies, then 
mutations alone must climb a Mount Improbable in a single leap—a 
situation that, given Axe’s results and Dawkins’s own logic, is 
probabilistically untenable. On the other hand, any model for the origin 
of genetic information that envisions a significant role for natural 
selection, by assuming a preexisting gene or protein under selective 
pressure, encounters other equally intractable difficulties. The evolving 
genes and proteins will range over a series of disadvantageous or 
nonfunctional intermediates that natural selection will not favor or 
preserve, but will, instead, eliminate. At that point, selection-driven 
evolution will cease, locking existing genes and proteins into place. 
Thus, whether one envisions the evolutionary process beginning with 
a preexisting functional gene or a duplicated non-coding region of the 
genome, the results of mutagenesis experiments present a precise 
quantitative challenge to the efficacy of the neo-Darwinian mechanism. 
Indeed, our growing knowledge about the rarity and isolation of proteins 
and functional genes in sequence space implies that neither neo- 
Darwinian scenario for producing new genes is at all plausible. Thus, 
neo-Darwinism does not explain the Cambrian information explosion. 
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Assume a Gene 


When I first heard that Douglas Axe had succeeded in making a rigorous 
estimate of the rarity of proteins in sequence space, I wondered what 
neo-Darwinists would say in response. Given the experimental rigor and 
mathematical precision of the work he reported in the Journal of 
Molecular Biology in 2004, and the long odds against mutation and 
selection ever finding a novel gene or functional protein, what could 
they say? That the probability of a successful search for new genes and 
proteins was higher than Axe’s experiments suggested? That his 
methods or calculations were flawed? That no one else had gotten 
similar results? Since Axe’s work confirmed other analyses and 
experiments, and since his paper had passed through the careful scrutiny 
of peer review, none of those responses seemed plausible. Yet defenders 
of the adequacy of the neo-Darwinian mechanism were far from 
admitting defeat, as I would soon find out. 

The same year, I published a peer-reviewed scientific article about 
the Cambrian explosion and the problem of the origin of the biological 
information needed to explain it.! In the paper, I cited Axe’s results and 
explained why the rarity of functional proteins in sequence space posed 
such a severe challenge to the adequacy of the neo-Darwinian 
mechanism. The article appeared in a biology journal, Proceedings of 
the Biological Society of Washington, published out of the Smithsonian 
Institution by scientists working for the Smithsonian’s National Museum 
of Natural History (NMNH). Because the article also argued that the 
theory of intelligent design could help explain the origin of biological 


information (see Chapter 18), its publication created a firestorm of 
controversy. 

Museum scientists and evolutionary biologists from around the 
country were furious with the journal and its editor, Richard Sternberg, 
for allowing the article to be peer-reviewed and then published. 
Recriminations followed. Museum officials took away Sternberg’s keys, 
his office, and his access to scientific samples. He was transferred from a 
friendly to a hostile supervisor. A congressional subcommittee staff later 
investigated and found that museum officials initiated an intentional 
disinformation campaign against Sternberg in an attempt to get him to 
resign. His detractors circulated false rumors: “Sternberg has no degrees 
in biology” (actually he has two Ph.D.’s, one in evolutionary biology 
and one in systems biology); “He is a priest, not a scientist” (Sternberg 
is not a priest, but a research scientist); “He is a Republican operative 
working for the Bush campaign” (he was far too busy doing scientific 
research to be involved in political campaigns, Republican or 
otherwise); “He’s taken money to publish the article” (not true); and so 
on. Eventually, despite the demonstrable falsehood of the charges, he 
was demoted.7 

Major news stories about the controversy appeared in Science, 
Nature, The Scientist, and the Chronicle of Higher Education.» Then 
articles appeared in the mainstream press, including the Washington Post 
and the Wall Street Journal. A major story aired on National Public 
Radio.” Sternberg himself even appeared on The O’Reilly Factor. 

Despite the intense furor, there was no formal scientific response to 
my article: neither the Proceedings nor any other scientific journal 
published a scientific refutation. The members of the Council of the 
Biological Society of Washington who oversaw the publication of the 
journal insisted that they didn’t want to dignify it by responding. 

Eventually two scientists and a science education policy advocate— 
each associated with the National Center for Science Education, a group 
that lobbies for teaching evolution in the public schools—stepped 
forward. The three authors—geologist Alan Gishlick, education policy 
advocate Nicholas Matzke, and wildlife biologist Wesley R. Elsberry— 


published a response to my article on TalkReason.org, a prominent 
atheistic website. Although the website’s guidelines prohibit “ad 
hominem arguments,” the rule was somewhat loosely enforced in the 
case of Gishlick, Matzke, and Elsberry’s response, which they titled 
“Meyer’s Hopeless Monster.” 

Gishlick, Matzke, and Elsberry attempted to refute my central 
argument by citing a scientific paper that they said solved the problem of 
the origin of genetic information. The paper, a scientific review essay 
titled “The Origin of New Genes: Glimpses from the Young and Old,” 
had appeared in Nature Reviews Genetics in 2003. Gishlick, Matzke, and 
Elsberry asserted that this paper—coauthored by Manyuan Long, an 
evolutionary biologist at the University of Chicago, and several 
colleagues—was representative of an extensive “scientific literature 
documenting the origin of new genes.” 

Other biologists echoed Gishlick, Matzke, and Elsberry’s claim in the 
context of another public controversy. During the 2005 Kitzmiller v. 
Dover trial about an ill-advised attempt to require teachers in a 
Pennsylvania school district to read a statement about intelligent design, 
Brown University biologist Kenneth Miller cited Long’s paper in his 
testimony. He said that it shows how new genetic information evolves. 
The judge in the case, John E. Jones, then cited Miller’s testimony about 
Long’s article in his own decision. Judge Jones asserted there are “more 
than three dozen peer-reviewed scientific publications showing the 
origin of new genetic information by evolutionary processes.’”® 
Elsewhere Matzke, along with biologist Paul Gross, stated that the paper 
by Long “reviews all the mutational processes involved in the origin of 
new genes and then lists dozens of examples in which research groups 
have reconstructed the genes’ origins.”? In their view, “Competent 
scientists know how new genetic information arises.” 10 

But do evolutionary biologists really know this? 

Let’s take a closer look at the article that allegedly shows “how new 
genetic information arises.”!! 


ONCE UPON A GENE 


The oft-cited Long paper points to a variety of studies that purport to 
explain the evolution of various genes. These studies typically begin by 
taking a gene and then seeking to find other genes that are similar (or 
homologous) to it. They then seek to trace the history of slightly 
different homologous genes back to a hypothetical common ancestor 
gene (or genes). To do this, the studies survey databases of gene 
sequences looking for similar sequences in representatives of different 
taxonomic groups—often in closely related species. Some studies also 
attempt to establish the existence of a common ancestor gene on the 
basis of similar genes within the very same organism. They then 
typically propose evolutionary scenarios in which an ancestral gene 
duplicates itself! and then the duplicate and the original evolve 
differently as the result of subsequent mutations in each gene. 

Next, these scenarios invoke various kinds of mutations—duplication 
events, exon shuffling, retropositioning, lateral gene transfer, and 
subsequent point mutations—as well as the activity of natural selection 
(see Fig. 11.1). The evolutionary biologists conducting these studies 
postulate that modern genes arose as the result of these various 
mutational processes—processes that they envision as having shaped 
genes during a long evolutionary history. Since the information in 
modern genes is presumably different from the information in the 
hypothetical ancestor genes, they regard the mutational mechanisms that 
are allegedly responsible for these differences as the explanation for the 
origin of genetic information. 

Upon closer examination, however, none of these papers demonstrate 
how mutations and natural selection could find truly novel genes or 
proteins in sequence space in the first place; nor do they show that it is 
reasonably probable (or plausible) that these mechanisms would do so in 
the time available. These papers assume the existence of significant 
amounts of preexisting genetic information (indeed, many whole and 
unique genes) and then suggest various mechanisms that might have 
slightly altered or fused these genes together into larger composites. At 
best, these scenarios “trace” the history of preexisting genes, rather than 
explain the origin of the original genes themselves (see Fig. 11.2). 


This kind of scenario building can suggest potentially fruitful 
avenues of research. But an obvious error comes in mistaking a 
hypothetical scenario for either a demonstration of fact or an adequate 
explanation. None of the scenarios that the Long paper cites demonstrate 
the mathematical or experimental plausibility of the mutational 
mechanisms they assert as explanations for the origin of genes. Nor do 
they directly observe the presumed mutational processes in action. At 
best, they provide hypothetical, after-the-fact reconstructions of a few 
events out of a sequence of many supposed events, starting with the 
existence of a presumed common ancestor gene. But that gene itself 
does not represent a hard data point. It is inferred to have existed on the 
basis of the similarity of two or more other existing genes, which are the 
only actual pieces of observational evidence upon which these often 
elaborate scenarios are based. 
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FIGURE 11.1 


Various types of mutations that are alleged to result in the modification of genes: exon shuffling, 
retro-positioning, lateral gene transfer, and gene fusion. 
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FIGURE 11.2 

Depiction of how gene duplication and subsequent gene evolution might 
take place. While the gene on the bottom remains under selective 
pressure and cannot experience many mutations without loss of fitness 
or function (see Figure 10.3), the duplicated gene at the top can in theory 
vary without deleterious consequences to the organism. 


That these scenarios depend on various inferences and postulations 
doesn’t, by itself, disqualify them from consideration. Nevertheless, 
whether they adequately explain the origin of genetic information 
depends upon the evidence for the existence of the entities they infer 
(the ancestral genes) and the plausibility of the mutational mechanisms 
they postulate. Let’s look at both parts of these scenarios. 


COMMON ANCESTOR GENES? 


Nearly all of the scenarios developed in the papers that Long cites start 
with an inferred common ancestral gene from which two or more 
modern genes diverged and developed. These scenarios treat the 
similarity of sequence (the information) in two or more genes as 
unequivocal evidence for a common ancestral gene (see Fig. 11.2). As I 
noted in Chapters 5 and 6, standard methods of phylogenetic 
reconstruction presuppose, rather than demonstrate, that biological 


similarity results from shared ancestry. Yet, as we saw in Chapter 6, 
similarity of sequence by itself is not always an unequivocal indicator of 
common ancestry. Sometimes similarity appears between species where 
it cannot be explained by inheritance from a common ancestor (e.g., the 
similar forelimbs on moles and mole crickets) and, at the very least, 
there are other possible explanations for sequence similarity. 

In the first place, similar gene sequences might have evolved 
independently on two parallel lines of descent starting with two different 
genes, as the hypothesis of convergent evolution asserts. Recent 
examples of convergent genetic evolution now abound in the literature 
of molecular and evolutionary biology.!? For example, molecular 
biologists have discovered that both whales and bats use similar systems 
—involving similar genes and proteins—for echolocation. The striking 
similarity of these systems used in two otherwise disparate mammalian 
species has led biologists to posit the parallel evolution of echolocation, 
including the gene sequences and proteins that make it possible, from a 
common ancestor that did not possess this system. !4 

In addition, it is possible that similar genes might have been 
separately designed to meet similar functional needs in different 
organismal contexts. Viewed this way, similarity of sequence does not 
necessarily reflect descent with modification from a common ancestor, 
but could reflect design in accord with common functional 
considerations, constraints, or goals. I recognize, of course, that to this 
point I have not given any independent reasons for considering the 
design hypothesis, and that, as a hypothesis for sequence similarity by 
itself, intelligent design may not seem compelling. (For more compelling 
reasons to consider intelligent design, see Chapters 17 through 19.) 
Nevertheless, I mention both these other possible explanations for the 
similarity of gene sequences in order to demonstrate that sequence 
similarity does not necessarily indicate, or derive from, a common 
ancestral gene. 


ORFAN GENES 


Some genes, and the information-rich sequences they contain, most 
certainly cannot be explained by reference to the kind of scenarios that 
Long cites. All of these scenarios attempt to explain the origin of two 
similar genes by reference to descent with modification (via mutation) 
from common ancestral genes. Yet genomic studies are now turning up 
hundreds of thousands of genes in many diverse organisms that exhibit 
no significant similarity in sequence to any other known gene.!° These 
“taxonomically restricted genes” or “ORFans” (for “open reading frames 
of unknown origin”) now dot the phylogenetic landscape. ORFans have 
turned up in every major group of organisms, including plants and 
animals as well as both eukaryotic and prokaryotic one-celled 
organisms. In some organisms, as much as one-half of the entire genome 
comprises ORFan genes.!° 

Thus, even if it could be assumed that similar gene sequences always 
point to a common ancestor gene, these ORFan genes cannot be 
explained using the kind of scenarios that Long’s article cites. Since 
ORFans lack sequence similarity to any known gene—that is, they have 
no known homologs in even distantly related species—it is impossible to 
posit a common ancestral gene from which a particular ORFan and its 
homolog might have evolved. Remember: ORFans, by definition, have 
no homologs. These genes are unique—one of a kind—a fact tacitly 
acknowledged by the increasing number of evolutionary biologists who 
attempt to “explain” the origin of such genes through de novo (“out of 
nowhere”) origination. 

Some might argue that as biologists map the sequence of more 
genomes and add more gene sequences to protein databases, homologs 
of these ORFans will eventually turn up, thus gradually eliminating the 
mystery surrounding the ORFan phenomenon. Yet to date the trend has 
gone in the opposite direction. As scientists have explored and 
sequenced more genomes, they have discovered more and more ORFans 
without finding anything like a corresponding number of homologs. 
Instead, the number of “unpaired” ORFan genes continues to grow with 
no sign of the trend reversing itself. !” 


THE PLAUSIBILITY OF THE MUTATIONAL PROCESSES 


Even if evolutionary biologists could establish the existence of the 
common ancestral genes from which their scenarios begin, that would 
not establish the plausibility of a neo-Darwinian mechanism for 
generating genetic information from that ancestor. Moreover, the term 
“plausibility? in this context has a specific scientific and 
methodologically significant meaning. Studies in the philosophy of 
science show that successful explanations in historical sciences such as 
evolutionary biology need to provide “causally adequate” explanations 
—that is, explanations that cite a cause or mechanism capable of 
producing the effect in question. In On the Origin of Species, Darwin 
repeatedly attempted to show that his theory satisfied this criterion, 
which was then called the vera causa (or “true cause”) criterion. In the 
third chapter of the Origin, for example, he sought to demonstrate the 
causal adequacy of natural selection by drawing analogies between it 
and the power of animal breeding and by extrapolating from observed 
instances of small-scale evolutionary change over short periods of time. 

In this, Darwin hewed to a principle of scientific reasoning that one 
of his scientific role models, the great geologist Charles Lyell, used as a 
guide for reasoning about events in the remote past. Lyell insisted that 
good explanations for the origin of geological features should cite 
“causes now in operation’”—causes known from present experience to 
have the capacity to produce the effects under study.!® 

Do the scenarios developed by various evolutionary biologists cited 
in the Long review essay meet this criterion? Duplication mutations and 
various other modes of random mutational change along with natural 
selection clearly constitute “causes now in operation.” No one disputes 
that. But have these processes demonstrated the capacity to produce the 
effect in question, namely, the genetic information necessary to 
structural innovation in the history of life? There are several good 
reasons to think that they have not. 


BEGGING QUESTIONS 


First, most of the mutational processes that evolutionary biologists 
invoke in the scenarios cited in the Long essay presuppose significant 
amounts of preexisting genetic information on preexisting genes or 
modular sections of DNA or RNA. The Long essay highlights seven 
main mutational mechanisms at work in the sculpting of new genes: (1) 
exon shuffling, (2) gene duplication, (3) retropositioning of messenger 
RNA transcripts, (4) lateral gene transfer, (5) transfer of mobile genetic 
units or elements, (6) gene fission or fusion, and (7) de novo origination 
(see Fig. 11.1). Yet each of these mechanisms, with the exception of de 
novo generation, begins with preexisting genes or extensive sections of 
genetic text. This preexisting functionally specified information is in 
some cases enough to code for the construction of an entire protein or a 
distinct protein fold. Moreover, these scenarios not only assume 
unexplained preexisting sources of biological information, they do so 
without explaining or even attempting to explain how any of the 
mechanisms they envision could have solved the combinatorial search 
problem described in Chapters 9 and 10. 

A closer look at each of these mechanisms will show why scenarios 
that rely on them beg important questions about the origin of genetic 
information. 

Advocates of exon shuffling envision modular sections of a genome 
randomly arranging and rearranging themselves to generate entirely new 
genes, not unlike rearranging whole paragraphs in an essay to generate a 
new article. In genomes that have regions that code for the production of 
proteins interspersed with regions that do not code for proteins, the term 
“exon” refers to a protein-coding region of the genome. These protein- 
coding regions of the genome are often interrupted by nonprotein-coding 
sections of the genome (called introns) that serve other functions, such 
as coding for the production of regulatory RNAs. In any case, exons 
store significant quantities of preexisting functionally specified 
information. 

Though most proteins are encoded by multiple exons, a single exon 
may encode a substantial unit of protein structure, such as a functional 
protein fold—a fact that advocates of exon shuffling count in their own 


attempts to explain novel proteins. They assume that exons can be 
blindly shuffled and mixed around to form genes. Nevertheless, this 
mechanism cannot produce new protein folds. Either an exon is large 
enough that it already encodes a protein fold—in which case it’s not 
creating a new fold—or it’s too small, small enough that multiple exons 
must be combined in order to form a stable protein fold. In this latter 
case, other problems—in particular, something called adverse side-chain 
interaction—will preclude success, as we will see. 

Evolutionary scenarios envisioning other mutational mechanisms also 
presuppose important sources of preexistent genetic information. Gene 
duplication, as the name implies, involves the production of a duplicate 
copy of a preexisting gene, already rich in functionally specified 
information. Retropositioning of messenger RNA transcripts occurs 
when an enzyme called reverse transcriptase takes a preexisting strand 
of messenger RNA and inserts its corresponding DNA sequence into a 
genome, also producing a duplicate of the coding portion of a 
preexisting gene. Lateral gene transfer involves transferring a 
preexisting gene from one organism (usually a bacterium) into the 
genome of another. The transfer of mobile genetic elements likewise 
occurs when preexisting genes enclosed in circular strands of DNA 
called plasmids enter one organism from another and eventually find 
themselves incorporated into a new genome. This process also mainly 
occurs in single-celled organisms. A similar process can occur in 
eukaryotes, where mobile genetic elements called transposons—often 
called “jumping genes”!’—can hop from place to place in the genome. 
Gene fusion occurs when two adjacent preexisting genes, each rich with 
specified genetic information, link together after the deletion of 
intervening genetic material.”?° 

Each of these six mutational mechanisms presupposes preexisting 
modules of specified genetic information. Some of these mutational 
mechanisms also depend upon sophisticated preexistent molecular 
machines such as the reverse transcriptase enzyme used in 
retropositioning or other complex cellular machinery involved in DNA 
replication. Since building these machines requires other sources of 


genetic information, scenarios that presuppose the availability of such 
molecular machines to assist in the cutting, splicing, or positioning of 
modular sections of genetic information clearly beg the question. 

Overall, what evolutionary biologists have in mind is something like 
trying to produce a new book by copying the pages of an existing book 
(gene duplication, lateral gene transfer, and transfer of mobile genetic 
elements), rearranging blocks of text on each page (exon shuffling, 
retropositioning, and gene fusion), making random spelling changes to 
words in each block of text (point mutations), and then randomly 
rearranging the new pages. Clearly, such random rearrangements and 
changes will have no realistic chance of generating a literary 
masterpiece, let alone a coherent read. That is to say, these processes 
will not likely generate specificity of arrangement and sequence and, 
therefore, do not solve the combinatorial search problem. In any case, all 
such scenarios also beg the question. There is a big difference between 
shuffling and slightly altering preexisting sequence-specific modules of 
functional information and explaining how those modules came to 
possess information-rich sequences in the first place. 


EVOLUTION EX NIHILO? 


Long does cite at least one type of mutation that does not presuppose 
existing genetic information, the de novo origination of new genes. For 
example, one paper he discusses sought to explain the origin of a 
promoter region for a gene (the part of the gene that helps initiate the 
transcription of the gene’s instructions) and found that “this unusual 
regulatory region did not really ‘evolve.’ ” Instead, it somehow snapped 
into being: “It was aboriginal, created de novo by the fortuitous 
juxtaposition of suitable sequences.”*! 

Many other papers invoke de novo origination of genes. Long 
mentions, for example, a study seeking to explain the origin of an 
antifreeze protein in an Antarctic fish that cites “de novo amplification 
of a short DNA sequence to spawn a novel protein with a new 
function.”?” Likewise, Long cites an article in Science to explain the 


origin of two human genes involved in neurodevelopment that appealed 
to “de novo generation of building blocks—single genes or gene 
segments coding for protein domains,” where an exon spontaneously 
“originated from a unique noncoding sequence.”?? Other papers make 
similar appeals. A paper in 2009 reported “the de novo origin of at least 
three human protein-coding genes since the divergence with chimp[s],” 
where each of them “has no protein-coding homologs in any other 
genome.” An even more recent paper in PLoS Genetics reported “60 
new protein-coding genes that originated de novo on the human lineage 
since divergence from the chimpanzee,””° a finding that was called “a 
lot higher than a previous, admittedly conservative, estimate.””° 

Another 2009 paper in the journal Genome Research was 
appropriately titled “Darwinian Alchemy: Human Genes from 
Noncoding RNA.” It investigated the de novo origin of genes and 
acknowledged, “The emergence of complete, functional genes—with 
promoters, open reading frames (ORFs), and functional proteins—from 
‘junk’ DNA would seem highly improbable, almost like the elusive 
transmutation of lead into gold that was sought by medieval 
alchemists.””’ Nonetheless, the article asserted without saying how that: 
“evolution by natural selection can forge completely new functional 
elements from apparently nonfunctional DNA—the process by which 
molecular evolution turns lead into gold.”78 

The presence of unique gene sequences forces researchers to invoke 
de novo origin of genes more often than they would like. After one study 
of fruit flies reported that “as many as ~12% of newly emerged genes in 
the Drosophila melanogaster subgroup may have arisen de novo from 
noncoding DNA,”” the author went on to acknowledge that invoking 
this “mechanism” poses a severe problem for evolutionary theory, since 
it doesn’t really explain the origin of any of its “nontrivial requirements 
for functionality.”*° The author proposes that “preadaptation” might 
have played some role. But that adds nothing by way of explanation, 
since it only specifies when (before selection played a role) and where 
(in noncoding DNA), not how the genes in question first arose. Details 
about how the gene became “preadapted” for some future function is 


never explained. Indeed, evolutionary biologists typically use the term 
“de novo origination” to describe unexplained increases in genetic 
information; it does not refer to any known mutational process. 

Taking stock, then, many of the mutational processes that Long cites 
either: (1) beg the question as to the origin of the specified information 
contained in genes or parts of genes, or (2) invoke completely 
unexplained de novo jumps—essentially evolutionary creation ex nihilo 
(“from nothing”). 

Thus, ultimately, the scenarios featured in Long’s review essay do not 
explain the origin of the specified information in either genes or sections 
of genes. That would require a cause capable of solving the 
combinatorial inflation problem discussed in the previous chapters. But 
none of the scenarios discussed in Long’s article even addresses this 
problem, let alone demonstrates the mathematical plausibility of the 
mechanisms they cite. Yet, Gishlick, Matzke, and Elsberry originally 
cited Long as a definitive refutation of my article—the one in which I 
argued that the rarity of genes and proteins in sequence space cast doubt 
on the power of selection and mutations to generate novel genetic 
information. Professor Miller, in his testimony at the celebrated Dover 
trial, even convinced a federal judge to affirm that Long had succeeded 
in demonstrating how genetic information originates in a celebrated 
legal ruling. Clearly, one cannot solve a problem or refute an argument 
by failing to address it.°! 


PROTEIN FOLDS: PLAUSIBLE BUT IRRELEVANT 
SCENARIOS 


There is a second and closely related difficulty associated with the 
scenarios cited by Long. Typically, they do not even try to explain the 
origin of new protein folds, and few of them analyze genes different 
enough from each other that their protein products could, even 
conceivably, exemplify different folds. Instead, they usually attempt to 
explain the origin of homologous genes—genes that produce proteins 


with the same folded structure performing the same function or a closely 
related one. 

For example, Long cites one study comparing the two genes RNASE/ 
and RNASEIB, which code for homologous digestive enzymes.** The 
two proteins perform nearly the same function: breaking down RNA 
molecules in the digestive tracts of colobine leaf-eating monkeys, 
though each does its job at a slightly different optimal chemical pH. 
More important, given that the amino-acid sequences of the two 
enzymes are 93 percent identical, structural biologists would expect both 
enzymes to utilize the same protein fold to accomplish their closely 
related tasks. 

Long also references a study of a gene that codes for a histone 
protein, Cid, in two closely related species of fruit flies—Drosophila 
melanogaster and Drosophila simulans. The study didn’t try to explain 
the “origin” of the gene—it merely compared the gene in the two 
species, catalogued some minor differences between them, and asked 
how those differences arose. The study identified some two dozen 
nucleotide differences between the genes for Cid in the two species— 
only 17 of which might have changed an amino acid in the sequence out 
of 226 total amino acids in the Cid protein. Such a slight (7.5 percent) 
difference would be extremely unlikely to translate into different protein 
folds. Indeed, natural sequences known to have different folds do not 
have anything like the correspondingly high degree (92.5 percent) of 
sequence identity. Instead, known natural sequences with this high level 
of sequence identity have the same fold. 

Long also cites two studies of FOXP2, a gene involved in regulating 
gene expression in humans, chimps, other primates, and mammals. In 
humans and other mammals, this gene is involved in brain 
development.** Nevertheless, according to one study, the protein coded 
for by this gene in humans acquired just “two amino-acid changes on the 
human lineage”? during the entire course of its evolution from a 
common chimp-human ancestor—again, not likely a sufficient enough 
change to generate a new protein fold. 


The Long review essay cites numerous scenarios of this type— 
scenarios attempting to explain the evolution of slight gene variants (and 
their similar proteins), not the origin of new protein folds. This is an 
important distinction because, as we saw in Chapter 10, new protein 
folds represent the smallest unit of selectable structural innovation, and 
much larger structural innovations in the history of life depend upon 
them. Explaining the origin of structural innovation requires more than 
just explaining the origin of variant versions of the same gene and 
protein or even the origin of new genes capable of coding for new 
protein functions. It requires producing enough genetic information— 
truly novel genes—to produce new protein folds. 

Thus, even where these scenarios are plausible, they are not relevant 
to explaining the origin of the genetic information necessary to produce 
the kind of structural innovation that occurs in the Cambrian explosion 
(or many other events in the history of life). 


PROTEIN FOLDS: RELEVANT BUT IMPLAUSIBLE 
SCENARIOS 


In a few cases, the evolutionary scenarios cited in the Long paper appear 
to be attempts to explain genes that are different enough from each other 
that they could conceivably code for proteins with different folds. For 
example, Long discusses several papers that equate exon shuffling with 
the shuffling of protein domains. Recall that a protein domain is a stable 
“tertiary” protein structure or fold made of many smaller “secondary” 
structures such as alpha helices or beta strands (see Fig. 10.2). Many 
complex proteins have numerous domains, each exhibiting a unique fold 
or tertiary structure. One version of the exon-shuffling hypothesis 
assumes that each exon codes for a specific protein domain. It envisions 
random cutting and splicing—excising, shuffling, and recombination— 
of the exon portions of the genome, resulting in the modular 
rearrangement of genetic information. The resulting composite gene will 
then code for a new composite protein structure. As Long proposes, 
“Exon shuffling, which is also known as domain shuffling, often 


recombines sequences that encode various protein domains to create 
mosaic proteins.’””*° 

Of the mechanisms that Long discusses, exon shuffling (and the 
closely related idea of gene fusion) provides perhaps the most plausible 
means of generating new (composite) proteins.*’ Nevertheless, the idea 
that exon shuffling can explain the origin of the genetic information 
necessary to produce new protein folds or whole composite proteins is 
problematic for several reasons. 

First, the exon-shuffling hypothesis seems to assume that each exon 
involved in the process codes for a protein domain that folds into a 
distinctive tertiary structure. To a protein scientist, a protein domain is 
equivalent to a protein fold, though distinct protein structures (folds) 
may be composed of several smaller domains (or folds). Thus, at the 
very least, the exon-shuffling hypothesis presupposes the prior existence 
of a significant amount of genetic information—enough information to 
build an independent protein domain or fold. As such, it fails as an 
explanation for the origin of protein folds and the information necessary 
to produce them. 

Some advocates of exon shuffling, however, may be using the term 
“protein domain” in a slightly fuzzier way. They might be equating 
domains with smaller structural units such as fragments of a fold made 
of several units of secondary structure such as alpha helices or beta 
strands. Conceived this way, the exon-shuffling hypothesis would then 
entail the construction of a new protein structure by combining these 
smaller “fragments.” 

But in most cases, if the amino-acid chain that forms a domain is 
chopped into fragments, then the resulting isolated pieces would cease to 
retain their original shapes. Why? Because the three-dimensional shape 
of one small section of a protein is heavily dependent upon the overall 
structure and shape of the rest of the protein. Snip out a section or 
fragment, or synthesize a fragment in isolation from the rest of the 
protein, and a floppy amino-acid chain will result—one that has entirely 
lost its original shape, or ability to form a stable structure. Thus, this 
version of the exon-shuffling hypothesis lacks credibility because it 


incorrectly assumes that shapeless protein fragments can be mixed and 
matched in a modular fashion to form new stable, functional protein 
folds. Moreover, even if such shuffling were physically plausible, this 
version of the hypothesis would have another problem. It still 
presupposes unexplained functional information—in particular, the 
information necessary to specify, not just the smaller fragments, but also 
the information required to arrange these smaller units into stable folds, 
and ultimately functional proteins. 

Second, since the exon-shuffling hypothesis assumes that each exon 
involved in the shuffling codes for a specific protein domain, it also 
assumes that exon boundaries correspond to the boundaries of protein 
domains or folds. In existing genes, however, exon boundaries do not 
typically correspond to the boundaries of folded domains within the 
larger proteins.*® If the shuffling of exons explained how actual proteins 
had come into existence, then there should be a clear correspondence or 
correlation between exon boundaries within genes and the corresponding 
protein domains within larger composite structures (1.e., whole proteins). 
The absence of such a correspondence suggests that exon shuffling does 
not account for the origin of known compound protein structures. 

Third, relying on exon shuffling to cobble together a new protein fold 
from smaller units of protein structure is physically implausible for 
another reason. To see why, we need to examine what a “side chain” is. 
All twenty protein-forming amino acids have a common backbone 
(made of nitrogen, carbon, and oxygen), but each one has a different 
chemical group called a side chain sticking out at roughly right angles 
from that backbone. The interactions between side chains determine 
whether secondary units of protein structure made from chains of amino 
acids will fold into larger stable three-dimensional folds.*? Though many 
different sequences will generate secondary structures (alpha helices and 
beta strands), generating stable folds is much more difficult and requires 
much more specificity in the arrangement of amino acids and their side 
chains. Specifically, since the elements in smaller secondary structural 
units in proteins are surrounded by side chains, they cannot be combined 
into new folds unless the elements have the sequence specificity 


required for the side chains to complement one another.*? That means 
smaller secondary structural units will rarely*! fuse together to form 
stable tertiary structures or folds. Instead, attempts to form new folds 
from smaller units of structure repeatedly encounter adverse interactions 
between the side chains of the amino acids within units of secondary 
structure. 

The need for extreme specificity in the sequential arrangements of 
amino acids, discussed in the previous chapter, means that the 
overwhelming majority of amino-acid sequences in units of secondary 
structure will not result in stable folds as these units of structure come 
into contact with each other. As discussed in Chapter 10, the extreme 
rarity of functional proteins (with stable folds) in sequence space ensures 
that the probability of finding a correct fold-stabilizing sequence will be 
astonishingly small. For this reason, even skilled protein scientists have 
struggled to design sequences that will produce stable protein folds.“ 
Almost invariably the units of secondary structure that they attempt to 
combine or otherwise place into stable composite structures will not fold 
because of the interactions of their amino-acid side chains.** As 
molecular biologist Ann Gauger explains, “Thus, [alpha] helices and 
[beta] sheets are sequence-dependent structural elements within protein 
folds. You can’t swap them around like Lego bricks.” 

Nor is it an easy matter to simply find different sequences of amino 
acids that will stabilize folds from smaller secondary units of structure, 
again, because of the extreme rarity of functional (and folding) 
sequences within amino-acid sequence space. Generating specific 
sequences that will fold into stable structures, whether in the lab or 
during the history of life, requires solving the combinatorial inflation 
problem. Even small folds will require five or six units of secondary 
structure with 10 or so amino acids in each unit, that is, 60 or more 
precisely sequenced amino acids. Modest-size folds will require a dozen 
or more units of secondary structure and 150 to 200 specifically 
arranged amino acids in order to stabilize a fold. Larger protein folds 
will require many more secondary units and specifically arranged amino 
acids. Since, however, many mission-critical functions within even the 


simplest cell require many folds (of at least 150 amino acids) working in 
close coordination, the need to produce proteins of at least this length 
numerous times through the history of life cannot be avoided. 

All this requires searching for a functional needle in a vast haystack 
of combinatorial possibilities. Recall that Douglas Axe estimated the 
ratio of needles (functional sequences) to strands of straw in the 
haystack (nonfunctional sequences) to be 1 to 107’ for sequences of 
modest-length (150 amino acids). 

Of course, in naturally occurring proteins, the interactions between 
side chains in units of secondary structure do maintain stable folds. But 
these proteins, with their stable three-dimensional folded structures, 
depend upon exceedingly rare and precisely arranged sequences of 
amino acids. The question is not whether the combinatorial search 
problem necessary to produce stable protein folds has ever been solved, 
but whether a neo-Darwinian mechanism relying on random mutations 
(in this case random shuffling of exons) provides a plausible explanation 
for how it might have been solved. 

The papers that Long cites give no reason to think that exon shuffling 
(or any other mutational mechanism) has solved this problem. The exon- 
shuffling hypothesis ignores the need for side-chain specificity, though 
the need for such specificity has repeatedly defeated attempts in the 
laboratory to build new proteins from units of secondary structure in the 
manner required. 

But advocates of exon shuffling make no attempt to show how 
random rearrangements of protein domains—whether the domains are 
conceived of as fragments of a fold or whole folds—would solve the 
combinatorial problem. Nor do they challenge Sauer’s or Axe’s 
experimentally derived quantitative estimates of the rarity of functional 
genes or proteins. They do not challenge the probability calculations 
based on these estimates. And they do not show that a mechanism exists 
that can search amino-acid sequence spaces more effectively or 
efficiently than random mutation and selection. Nor do they demonstrate 
the efficacy of exon shuffling in a model system in the laboratory. 
Instead, basic considerations of protein structure imply the implausibility 


of exon shuffling as a means of generating the genetic information 
necessary to produce a new protein fold.*° So, in the end, with few 
words and with apparent confidence, advocates of exon shuffling simply 
assert, as the Long paper does, that “exon shuffling often recombines 
sequences that encode various protein domains to create mosaic 
proteins.” 


WORD SALAD 


The assertion of Long and his colleagues about exon shuffling, like 
many other statements about postulated mutational mechanisms, blurs 
the distinction between theory and evidence. Despite the authoritative 
tone of such statements, evolutionary biologists rarely directly observe 
the mutational processes they envision. Instead, they see patterns of 
similarities and differences in genes and then attribute them to the 
processes they postulate. Yet the papers that Long cites offer neither 
mathematical demonstration, nor experimental evidence, of the power of 
these mechanisms to produce significant gains in biological information. 

In the absence of such demonstrations, evolutionary biologists have 
taken to offering what one biologist I know calls “word salad”—,argon- 
laced descriptions of unobserved past events—some possible, perhaps, 
but none with the demonstrated capacity to generate the information 
necessary to produce novel forms of life. This genre of evolutionary 
literature envisions exons being “recruited’*° and/or “donated’*” from 
other genes or from an “unknown source’**; it appeals to “extensive 
refashioning’”? of genes; it attributes “fortuitous juxtaposition of 
suitable sequences”°? to mutations or “fortuitous acquisition”?! of 
promoter elements; it assumes that “radical change in the structure” of a 
gene is due to “rapid, adaptive evolution”; it asserts that “positive 
selection has played an important role in the evolution’”>’ of genes, even 
in cases when the function of the gene under study (and thus the trait 
being selected) is completely unknown; it imagines genes being 
“cobbled together from DNA of no related function (or no function at 


all)”;°° it assumes the “creation” of new exons “from a unique 


noncoding genomic sequence that fortuitously evolved”;>° it invokes 
“the chimeric fusion of two genes”;>’ it explains “near-identical’”>® 
proteins in disparate lineages as “a striking case of convergent 
evolution”;5? and when no source material for the evolution of a new 
gene can be identified, it asserts that “genes emerge and evolve very 
rapidly, generating copies that bear little similarity to their ancestral 
precursors” because they are apparently “hypermutable.”© Finally, 
when all else fails, scenarios invoke the “de novo origination” of new 
genes, as if that phrase—any more than the others just mentioned— 
constitutes a scientific demonstration of the power of mutational 
mechanisms to produce significant amounts of new genetic 
information.°! 

These vague narratives resemble nothing so much as the naming 
games of scholastic philosophers in the Middle Ages. Why does opium 
put people to sleep? Because it has a “dormative” virtue. What causes 
new genes to evolve so rapidly? Their “hypermutability” or perhaps 
their ability to undergo “rapid, adaptive evolution.” How do we explain 
the origin of two similar genes in two separate, but otherwise widely 
disparate lineages? Convergent evolution, of course. What is convergent 
evolution? The presence of two similar genes in two separate, but 
otherwise widely disparate lineages. How does convergent evolution 
occur, given the improbability of finding even one functional gene in 
sequence space, let alone the same gene arising twice independently? No 
one knows exactly, but perhaps it was a “fortuitous juxtaposition of 
suitable sequences,” or “positive selection,” or “de novo origination.” 
Need to explain two similar genes in more closely related lineages? Try 
“gene duplication,” or “chimerical gene fusion,” or “retropositioning,” 
or “extensive refashioning of the genome,” or some other scientific- 
sounding combination of words. 

The vagueness of these scenarios raises serious questions about how 
scientists could regard them as decisive demonstrations or refutations of 
anything—let alone refutations of the kind of experimentally based, 
mathematically precise challenges to mutation and selection described in 
the previous chapter. 


So despite the official pronouncement of a federal judge and claims 
of extensive “scientific literature documenting the origin of new genes,” 
evolutionary biologists have not demonstrated how new genetic 
information arises, at least not in amounts sufficient to build protein 
folds, the crucial units of biological innovation. Biologists have not 
solved the problem of combinatorial inflation or refuted the precise 
quantitative argument against the creative power of the selection and 
mutation mechanism presented in the previous chapter (or in my 2004 
article). Nor has anyone provided a compelling refutation of Douglas 
Axe’s assessment of the rarity of genes and proteins on which that 
argument is based. 

In fairness, neo-Darwinian biologists have mathematical models of 
their own—models indicating to them that nearly unlimited evolutionary 
change can occur under the right conditions. The assumption that these 
models, which are based on the equations of population genetics, 
accurately represent how much evolution can occur has left many 
evolutionary biologists confident in the creative power of various 
mutational mechanisms. But should they be? 

In the next chapter, I will take up this question. As I do, PII explain 
why evolutionary biologists have been, heretofore, untroubled by 
mathematical challenges to neo-Darwinism. [Il also show why that has 
begun to change as new developments in molecular genetics have 
introduced another formidable mathematical challenge to the creative 
power of the neo-Darwinian mechanism—a challenge that arises from 
within the neo-Darwinian framework and raises yet new questions about 
the causal adequacy of the neo-Darwinian mechanism. 
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Complex Adaptations and the Neo- 
Darwinian Math 


University of Illinois biologist Tom Frazzetta knew the textbook story as 
well as anyone. According to neo-Darwinian theory, organisms with all 
their complex systems came into existence via natural selection acting 
on randomly arising, small-scale variations and mutations. As Frazzetta 
understood, this evolutionary mechanism necessarily transforms 
organisms gradually, with modifications parceled into increments “as a 
sort of continuous change, where one structural condition melts 
gradually into another.”! 

Frazzetta had his doubts, however. As an expert in functional 
biomechanics—studying how animals actually work—he had dissected 
the skulls of rare snakes found only on the island of Mauritius, in the 
Indian Ocean. These snakes, called bolyerines, are boa-like but have an 
anatomical specialization found in no other vertebrate. Their maxilla, the 
tooth-bearing bone of the upper jaw, is divided into two segments, linked 
by a flexible joint and serviced by many specialized nerves, extra bones, 
tissues, and differently arranged ligaments. This unique trait allows the 
snakes to bend the front half of their upper jaw backwards when they 
attack prey (see Fig. 12.1). 

Could this complex system of bones, joints, tissues, and ligaments 
have evolved gradually? “A movable joint dividing the maxilla into two 
segments,” observed Frazzetta, “seems to have either a presence or 
absence, with no intermediate to connect the two conditions.” That is, 
either the maxilla occurs as one bone (as it does in every other 


vertebrate) or as two segments with all the accompanying joints, bones, 
ligaments, and tissues necessary to make it work, as it does in the 
bolyerine snakes. No intermediate condition—a broken maxilla with two 
pieces of bone lacking the necessary joints, tissues, and ligaments, for 
example—appears viable. As Stephen Jay Gould asked of the same 
system, “How can a jawbone be half broken?” Or as Frazzetta himself 
observed, “I thus find it difficult to envision a smooth transition from a 
single maxilla to the divided condition seen in bolyerines.”* Yet because 
the intermediate forms would not be viable, building a bolyerine jaw 
would require all the necessary parts—the jointed maxilla, the adjoining 
ligaments, and the necessary muscles and tissues—arising together. 

Yet the problem for neo-Darwinian theory, Frazzetta realized, 
extended well beyond the anatomical peculiarities of rare snakes. As a 
young evolutionary biology professor, he had studied complex features 
in a wide variety of species. He knew that almost any biological 
structure of interest—the inner ear, the amniotic egg, eyes, olfactory 
organs, gills, lungs, feathers, the reproductive, circulatory, and 
respiratory systems—possesses multiple necessary components. To 
change such systems requires altering each of the many independent 
parts upon which their functions are based. This cannot be done willy- 
nilly. For example, changing any of the three bones of the mammalian 
inner ear—the incus, stapes, or malleus—will perforce require 
corresponding changes in the other bones and in other parts of the ear as 
well, such as the tympanic membrane or the cochlea. Complex 
biological systems depend for their functions on tens or hundreds of 
such independent, yet jointly necessary parts. As the number of 
necessary components increases, the requisite number of coordinated 
changes increases too, rapidly driving up the difficulty of maintaining 
the functional integrity of the system while modifying any of its parts. 
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FIGURE 12.1 
A complex adaptation: the jointed upper jaw of the bolyerine snake, 
made possible by its accompanying tendons, ligaments, and 
musculature. The other skull shows the single-boned jaw, found in other 
related snakes. 


And that was the problem, as Frazzetta understood it. Any system 
that depends for its function on the coordinated action of many parts 
could not be changed gradually without losing function. But in the neo- 
Darwinian scheme of things, natural selection acts to preserve only 
functional advantages. Changes that result in death or reduced function 
will not be preserved. The integrated complexity of many biological 
systems thus imposes limitations on the evolutionary process— 
limitations that human engineers do not face when they design complex 
integrated systems. In 1975, Frazzetta wrote a minor classic entitled 
Complex Adaptations in Evolving Populations explaining this concern. 
He wrote: 


When modifying the design of a machine, an engineer is not bound 
by the need to maintain a real continuity between the first machine 
and the modification. . . . But in evolution, transitions from one 
type to the next presumably involve a greater continuity by means 
of a vast number of intermediate types. Not only must the end 
product—the final machine—be feasible, but so must be all the 
intermediates. The evolutionary problem is, in a real sense, the 
gradual improvement of a machine while it is running!” 


Historically, evolutionary biologists tried to solve this problem one 
advantageous variation or mutation at a time. Starting with Darwin 
himself, they have attempted to explain how natural selection and 
random variation could build complex systems as the result of a series of 
incremental changes, each of which might confer some selectable 
advantage. Darwin famously employed this strategy to explain the origin 
of the eye, asking his readers to imagine a series of incremental, 
advantageous changes to a simple “nerve sensitive to light.’ 

As Frazzetta thought about the problem of explaining the origin of 
complex systems, he came to doubt both the classical and the modern 
Darwinian accounts of such systems. Frazzetta acknowledged that he 
was influenced in part by the skepticism expressed by the Wistar 
“outsiders” (see Chapter 9). He admitted “revealing some hideous 
personalia” in confessing that he was attracted to the worries about neo- 
Darwinism expressed by Murray Eden and other Wistar skeptics.’ 

Frazzetta’s concerns about the adequacy of the neo-Darwinian 
mechanism, like Eden’s, turned on the growing appreciation of the 
nature and importance of genetic information. Though biologists then 
(as now) didn’t fully understand how genetic information in DNA 
correlates or “maps” to these higher-level complex morphological 
structures, by 1975 they did know that many hundreds of genes can be 
involved in coding for a single complex integrated structure. Thus, 
altering the anatomical structure of the mammalian ear or the vertebrate 
eye, for example, would involve altering the genes that code for its 
constituents, which implies, most implausibly, that multiple coordinated 
mutations would occur virtually simultaneously. 

As Frazzetta explained, “Phenotypic alteration of integrated systems 
requires an improbable coincidence of genetic (and hence, heritable 
phenotypic) modifications of a tightly specified kind.” Yet the extreme 
specificity of the fit of the components and the functional dependence of 
the whole system on this fit imply limits to allowable genetic change. 
Genetic change affecting any one of the necessary components, unless 
matched by many corresponding—and vastly improbable—genetic 
changes, will result in functional loss and often death. For this reason, as 


Frazzetta concluded, “We are still left with the unabating need to explain 
evolutionary changes in systems that have the operational integration 
characteristic of things we recognize as ‘machines.’ ”’ At the time, the 
doubts he expressed gained little traction in the evolutionary biology 
community, because neo-Darwinian evolutionary biologists assumed 
that mutation and selection had nearly unlimited creative power, enough 
to generate even complex systems of the kind described in Frazzetta’s 
book. 

The mathematical expression of neo-Darwinian theory, as represented 
in the equations of a subdiscipline of biology known as population 
genetics, seemed to confirm this conviction. Population genetics models 
how gene frequencies change as the result of processes such as mutation, 
genetic drift (neutral changes in the genome that natural selection neither 
favors nor eliminates), and natural selection. On the assumption that 
advantageous variations or traits will arise as the result of even single 
mutations, the mathematical models of population genetics describe how 
much evolutionary change can occur in a given period of time. These 
estimates are based upon, among others, three primary factors: mutation 
rates, effective population sizes, and generation times. When 
evolutionary biologists plug estimates for these factors into the equations 
of population genetics, their calculations seem to imply that standard 
evolutionary mechanisms could generate significant amounts of 
evolutionary change in many groups of organisms—even enough to 
build complex systems. As long as mutations generate a continuous 
supply of new traits, any system, however complex, can be built one trait 
at a time—trait upon trait—via the creative power of natural selection. 
Or so the story goes. 

Confidence in these mathematical models (and their underlying 
assumptions) led many neo-Darwinists to disregard the need to give 
detailed accounts of the specific evolutionary pathways by which 
complex systems might have arisen. For example, in an evolutionary 
biology text widely used about the time Frazzetta first posed this 
challenge, evolutionary biologists Paul Ehrlich and Richard Holm 
advised: 


One need not go into the details of the evolution of the bird s wing, 
the giraffes neck, the vertebrate eye, the nest building of some 
fish, etc., as the selective origins of these and other structures and 
of behavioral patterns may be assumed to be basically the same in 
outline as those, such as industrial melanism, which have already 
been discussed. Even a slight advantage or disadvantage in a 
particular genetic change provides a sufficient differential for the 


operation of natural selection." 


The phrase “sufficient differential for the operation of natural selection” 
refers to the equations of population genetics and one of the factors (the 
so-called selection coefficient) that determines how rapidly particular 
traits would be likely to disseminate through a population. The message 
was clear: the math tells the story; the biological details of the origin of 
complex systems don’t matter. 

The neo-Darwinian focus on mathematical modeling helps to explain 
why mainstream evolutionary biologists haven’t worried about the 
problem of the origin of new genes and proteins or the problem of 
combinatorial inflation, discussed in Chapters 9 and 10. Many 
contemporary evolutionary biologists, like the founders of population 
genetics, assumed that some mechanism for building new genes already 
existed. Indeed, they assumed that new traits (and the genes for building 
them) can arise as the result of even single mutations (or a series of such 
mutations that each confer a small, incremental, selectable advantage). 
Thus, the mathematical expression of neo-Darwinian theory seemed to 
certify the plausibility of even large-scale evolutionary changes—again, 
provided these changes could occur one mutation at a time. 

But what if there are systems in living organisms that cannot be built 
one mutation at a time, and instead must be built by simultaneous 
coordinated changes? What if building just a single new gene or protein 
requires such coordinated mutational changes? What if individual genes 
turned out to be complex adaptations? 

Mathematical challenges of the kind first advanced at Wistar, and that 
Douglas Axe’s experimental findings have exacerbated, initially did not 


dent confidence in the adequacy of neo-Darwinian explanations. Many 
evolutionary biologists have simply regarded mathematical challenges to 
the creative power of the mechanism, coming as they mostly do from 
scientists and engineers in other fields, as exotic or irrelevant. 

That has begun to change. And it has begun to change in a way that 
has not only introduced a new mathematical challenge to the creative 
power of the neo-Darwinian mechanism, but also in a way that indirectly 
confirms Axe’s key insight about the rarity of genes and proteins. In the 
last decade, developments in molecular genetics and population genetics 
have exposed a connection between the problem of the origin of new 
genes and proteins and the origin of complex adaptations, a connection 
first perceived by Tom Frazzetta back in 1975. As more biologists have 
recognized that connection, they too have begun to share Frazzetta’s 
doubt. 


POPULATION GENETICS AND THE ORIGIN OF GENETIC 
INFORMATION 


The neo-Darwinian synthesis was formulated during the 1930s before 
the elucidation of the structure of DNA. Biologists at that time did not 
yet understand the nature, structure, or precise location of genetic 
information.'' They did not associate genes with long strings of 
nucleotide bases along the spine of the DNA molecule. They did not 
think of genes as long sections of digital code stored in complex 
biomacromolecules. Instead, after Mendel, but before Watson and Crick, 
genes were defined operationally as those entities, associated with 
chromosomes, that produced specific visible or selectable anatomical 
traits, such as eye color or beak shape. 

The architects of neo-Darwinism working in the 1930s reformulated 
evolutionary theory to emphasize the importance of mutations as the 
source of genetic variation. It followed, therefore, that the mutations— 
which they regarded as the source of heritable variation—must operate 
on genes. Not knowing the nature of genes, they also assumed that a 


single mutation could alter a gene in such a way as to produce a new 
trait. 

The equations of population genetics are predicated upon this 
assumption. The rate of mutation thus emerges as an important factor in 
computing the amount of evolutionary change that can occur in any 
given population. If every individual mutation can produce a new, 
potentially selectable trait, then the rate at which such variation 
accumulates partially determines how much change can occur in a given 
time. 

After 1953, biologists no longer conceived of the gene as an abstract 
entity. Watson and Crick showed that the gene had a definite locus and 
structure and that individual genes contain hundreds or thousands of 
precisely sequenced nucleotide bases, each functioning as digital 
characters in a larger instruction set. Consequently, biologists changed 
their understanding of mutations as well. Biologists came to understand 
mutations as something like typographic errors in long strings of digital 
code. As a result, many scientists began to realize that individual 
mutations were unlikely by themselves to produce new beneficial traits. 
Some scientists realized that mutations were instead overwhelmingly 
more likely to degrade the information contained in a gene than to 
produce a new function or trait, and that the accumulation of mutations 
would eventually and typically result in the /oss of function. 

This change of perspective called for an explanation of how 
mutations could generate new genes—an explanation that was provided 
beginning in the 1970s with the ideas of gene duplication, subsequent 
neutral evolution, and positive selection. 

Though the theory of gene duplication played no formal role in the 
mathematical structure of population genetics, it did serve to buttress a 
critical assumption of the whole enterprise. After the 1950s, 
evolutionary biologists no longer assumed that single mutations would 
necessarily generate whole new traits. That left a critical assumption of 
population genetics essentially undefended. For many evolutionary 
biologists, the theory of gene duplication closed that conceptual gap. 
After the theory was formulated, many evolutionary biologists thought 


that a mechanism had been discovered by which sections of genetic text 
could accumulate multiple changes without compromising the fitness of 
an organism, thus ensuring the eventual production of new genes and a 
steady supply of new traits. 

So when Frazzetta confronted evolutionary biology with the problem 
of complex adaptations in the mid-1970s, most neo-Darwinian biologists 
responded with a collective yawn, if they noticed it at all. His challenge 
was no challenge after all. So implied the math of population genetics— 
provided its assumptions about the ease with which new mutations could 
generate new traits were valid. 

But were they valid? Could a series of separate mutations generate 
the new genes necessary to build new proteins and new traits, or did 
building genes require multiple coordinated mutations? 


ARE GENES COMPLEX ADAPTATIONS? 


Classically, Darwinian biologists have assumed that small, separate step- 
by-step changes could produce all biological structures and features, 
provided each change confers some survival or reproductive advantage. 
In his chapter in the 1909 anthology Darwin and Modern Science, the 
British geneticist William Bateson wryly described how this widespread 
assumption prevented evolutionary biologists from confronting the real 
difficulty of explaining the origin of complex adaptations: 


By suggesting that the steps through which an adaptive 
mechanism arises are indefinite and insensible, all further trouble 
is spared. While it could be said that species arise by an insensible 
and imperceptible process of variation, there was clearly no use in 
tiring ourselves by trying to perceive that process. This labour- 


saving counsel found great favor.” 


One of the first prominent evolutionary biologists to consider the 
possibility that building new genes and proteins might require multiple 
coordinated mutations was John Maynard Smith. Maynard Smith 


worked as an aeronautical engineer during World War II, but then took 
up the formal study of evolutionary biology after the war. He eventually 
helped to found the University of Sussex, where he also served as a 
distinguished professor of biology until the mid-1980s.!° 

In 1970, Maynard Smith wrote an article in Nature responding to an 
earlier article by Frank Salisbury, a biologist from Utah State University. 
Salisbury had raised questions about whether random mutations could 
explain the specificity of the arrangement of nucleotide bases necessary 
to produce functional proteins. Salisbury worried, following discussions 
at Wistar, that the probability of random mutations generating functional 
arrangements of bases or amino acids was prohibitively low. According 
to Salisbury’s calculations, “The mutational mechanism as presently 
imagined could fall short by hundreds of orders of magnitude of 
producing, in a mere four billion years, even a single required gene.”’!4 

To overcome this improbability, Maynard Smith proposed a model of 
protein evolution. While admitting that the origin of the first proteins 
remained a mystery, he suggested that one protein could evolve into 
another as the result of small incremental changes in amino-acid 
sequences, provided each sequence maintained some function at each 
step along the way. Maynard Smith compared protein-to-protein 
evolution to changing one letter in an English word in order to generate 
a different word (while at each step generating a different meaningful 
word). He used this example to convey how he thought protein evolution 
might work: 


WORD — WORE — GORE — GONE — GENE 
He explained: 
The words [in this analogy ] represent proteins, the letters 
represent amino acids, the alteration of a single letter corresponds 


to the simplest evolutionary step, the substitution of one amino 
acid for another, and the requirement of meaning corresponds to 


the requirement that each unit step in evolution should be from 
one functional protein to another.” 


As a self-professed “convinced Darwinist,” Maynard Smith realized 
that natural selection and random mutation could only build new 
biological structures from preexisting structures if each intermediate 
structure along the way conferred some adaptive advantage. He thought 
that this requirement applied as much to the evolution of new genes and 
proteins as it did to the evolution of new phenotypic traits or larger-scale 
anatomical structures.!° 

Nevertheless, the essentially digital or alphabetic character of the 
genetic information that directs protein synthesis suggested a problem to 
Maynard Smith. How, he asked, could one gene or protein evolve into 
another if such a transformation required multiple simultaneous changes 
in the bases of the genetic text (or arrangement of amino acids)? If 
building new genes required multiple coordinated mutations, then the 
probability of generating a new gene or protein would drop 
precipitously, since such a transformation would require not just one 
improbable mutational event, but two or three or more, occurring more 
or less at once. Here’s how he described the potential problem: 


Suppose that a protein ABCD . . . exists, and that a protein abCD 
... would be favoured by selection if it arose. Suppose further that 
the intermediates aBCD ... and AbCD . . . are nonfunctional. 
These forms would arise by mutation, but would usually be 
eliminated by selection before a second mutation could occur. The 


double step from abCD . . . to ABCD would thus be very unlikely 


to occur. 17 


In Maynard Smith’s view, the improbability associated with “double- 
step” or multiple-step coordinated mutations presented a significant 
potential problem for molecular evolution. In the end, however, he 
concluded that such mutations were so improbable that they must not 
have played a significant role in the evolution of novel structures. As he 


explained, “Such double steps . . . may occasionally occur, but are 
probably too rare to be important in evolution.” !® 

For several decades, the problem he flagged receded into obscurity. 
As biochemist H. Allen Orr pointed out in 2005 in the journal Nature 
Reviews Genetics, “Although Maynard Smith’s work appeared early in 
the molecular revolution,” his ideas about problems facing protein 
evolution “were almost entirely ignored for two decades.”!? Thus, Orr 
noted that evolutionary biologists stopped thinking about molecular 
evolution as a consequence of adaptive changes at the amino-acid level. 
Not until the first decade of the twenty-first century would biologists 
confront the challenge of making a rigorous quantitative analysis of the 
plausibility of protein-to-protein evolution. 


WAITING FOR COMPLEX ADAPTATIONS 


In 2004, Lehigh University biochemist Michael Behe (see Fig. 12.2), 
introduced briefly at the end of Chapter 9, and University of Pittsburgh 
physicist David Snoke published a paper in the journal Protein Science 
that returned to the problem first described by Maynard Smith.7° By this 
time, Behe had established himself as a prominent critic of neo- 
Darwinism by arguing that the neo-Darwinian mechanism did not 
provide an adequate explanation for the origin of functionally integrated 
“irreducibly complex” molecular machines. In his 2004 paper, Behe 
sought to extend his critique of neo-Darwinism by assessing its 
adequacy as an explanation for new genes and proteins. He and Snoke 
attempted to assess the plausibility of protein evolution in the case that it 
does indeed require multiple coordinated mutations. They applied 
standard neo-Darwinian modes of analysis derived from population 
genetics to make their evaluation. They considered the plausibility of the 
main neo-Darwinian model of gene evolution in which evolutionary 
biologists envision new genes arising by gene duplication and 
subsequent mutations in the duplicated gene. 

Behe and Snoke assessed the plausibility of this model for 
multicellular organisms in the case that multiple (two or more) point 


mutations must occur simultaneously in order to generate a new 
selectable gene or protein. Whereas Maynard Smith saw the need for 
multiple coordinated mutations as a potential problem, one that 
ultimately needn’t trouble evolutionary biologists, Behe and Snoke 
argued that evolutionary biologists do need to worry about it, and they 
quantified its severity. 

Behe and Snoke first noted that many proteins, as a condition of their 
function, require unique combinations of amino acids interacting in a 
coordinated way. For example, ligand binding sites on proteins—places 
where small molecules bind to large proteins to form larger functional 
complexes—typically require a combination of several amino acids. 
Behe and Snoke argued that in such cases the combinations of amino 
acids would have to arise in a coordinated fashion since the capacity for 
ligand binding depends on all the necessary amino acids being present 
together. In support of this inference, they cited an authoritative 
textbook, Molecular Evolution, by University of Chicago evolutionary 
biologist Wen-Hsiung Li. In it, Li notes that evolving ligand binding 
capacity in proteins such as hemoglobin may require “many mutational 
steps,””! even though the first steps on the way to building such capacity 
would confer no selective advantage. As Li explains, “Acquiring a new 
function may require many mutational steps, and a point that needs 
emphasis is that the early steps might have been selectively neutral [ non- 
advantageous] because the new function might not be manifested until a 
certain number of steps had already occurred.” 





FIGURE 12.2 
Michael Behe. Courtesy Laszlo Bencze. 


Behe and Snoke point out that this observation implies that a series of 
separate mutations could not generate a ligand binding function in a 
protein that previously did not have this capacity, since individual 
amino-acid changes would initially confer no selectable advantage on 
the protein lacking this function. Instead, evolving ligand binding 
capability would require multiple coordinated mutations. Behe and 
Snoke make a similar argument about the requirements for the evolution 
of protein-to-protein interactions. They note that for proteins to interact 
with each other in specific ways, typically at least several individually 
necessary amino acids must be present in combination in each protein, 
again, suggesting the need for multiple coordinated mutations. 


SO MANY CHANGES, SO LITTLE TIME 


Behe and Snoke used the principles of population genetics to assess the 
likelihood of various numbers of coordinated mutational changes 
occurring in a given period of time. They asked: Is it probable that there 
was enough time in evolutionary history to generate coordinated 
mutations? If so, how many coordinated mutations is it reasonable to 
expect in a period of time given various population sizes, mutation rates, 
and generation times? Then, for different combinations of these various 
factors, they assessed how long it would typically take to generate two 
or three or more coordinated mutations. They determined that generally 
the probability of multiple mutations arising in close (functionally 
relevant) coordination to each other was “prohibitively” low—it would 
likely take an immensely long time, typically far longer than the age of 
the earth. 


THE POWERBALL LOTTERY—POPULATION GENETICS 
MADE EASY 


Before going on, it might be helpful to understand a bit more about how 
the equations and principles of population genetics can be used to 
calculate what evolutionary biologists call “waiting times,” the expected 
time that it will take for a given trait to arise by various evolutionary 
processes. In his book The Edge of Evolution, Michael Behe illustrates 
these principles using a charming analogy to the Powerball lottery game 
that many American state governments use to raise money. 

To win at Powerball, contestants must purchase tickets with six 
numbers that match the numbers printed on six balls drawn from two 
drums. Five of the balls are selected from a drum containing 59 white 
balls, numbered 1 through 59. A sixth red ball, the so-called power ball, 
is chosen out of a drum of 35 red balls numbered 1 to 35. To win the 
jackpot—which can exceed $100 million—a player must purchase a 
ticket listing all six of the chosen numbers in any order. The Powerball 
website lists the probability of matching all six balls at roughly 1 in 175 
million. Depending on how many tickets have been purchased and how 
frequently drawings occur, it may take a very long time for someone to 
win. 

Behe asked his readers first to consider how long it will take, on 
average, to generate a lottery ticket with the winning numbers. He notes 
that knowing the probability of drawing such a winning ticket isn’t 
sufficient. The calculation also requires knowing how often drawings 
occur and how many tickets are sold. As Behe explains: “If the odds of 
winning are one in a hundred million, and if a million people play every 
time, then it will take on average about a hundred drawings for someone 
to win.” If there are about a hundred drawings per year, with a million 
people playing per drawing, “then it would take about a year before 
someone won. But if there were only one drawing per year, on average it 
would take a century to hit the jackpot.” More frequent drawings 
produce shorter waiting times. Less frequent drawings tend to require 
longer waiting times. Similarly, more players will decrease the average 
time necessary to produce a winner, while fewer players result in longer 
waits. 


Similar mathematical principles apply when calculating the expected 
waiting times for the evolution of biological features by mutation and 
selection. Biologists first need to assess the complexity of the system— 
or its inverse, the improbability of the feature occurring. As in 
Powerball, however, knowing the probability of an event by itself does 
not allow someone to calculate how long it will likely take for that event 
to occur. Such a calculation requires also knowing the size of the 
population (equivalent to how many people are playing Powerball) and 
how frequently new genetic sequences arise (equivalent to how 
frequently drawings are held). 

In Powerball, a new sequence of numbers arises in every drawing. 
But when organisms reproduce, they do not always generate a new 
sequence of nucleotide bases in their individual genes. For this reason, to 
calculate the rate at which new sequences arise in living organisms 
requires knowing two factors: the generation time and the mutation rate. 
More rapid rates of mutation and/or shorter generation times will 
increase the rate at which new genetic sequences arise, resulting in 
shorter waiting times. Slower rates of mutation and/or longer times 
between generations produce longer waiting times. Also, as in 
Powerball, the number of “players” is important. Larger populations 
generate new genetic sequences more frequently than smaller ones and, 
thus, decrease expected waiting times. Smaller populations reduce the 
rate at which new sequences are generated, increasing waiting times. 

Now, under the rules of Powerball, you can “win” without picking all 
six numbers correctly—you just won’t win the entire jackpot.” If you 
pick just the red “power ball” correctly, you win $4. Pick three white 
balls correctly, and you win $7. If you correctly pick the numbers of four 
white balls, you win $100. Guess all five white balls correctly (but not 
the red ball), and you can win a cool $1 million. 

With each additional ball necessary to secure a new level of 
winnings, the probability of winning decreases exponentially, while the 
values of the prizes increase dramatically. The Powerball website lists 
the probability of winning a $4 prize at just 1 in 55, the probability of 


winning $1 million dollars at roughly 1 in 5 million, and the probability 
of winning the jackpot as 1 in 175 million (see Fig. 12.3).7> 








NUMBER OF BALLS 
WHICH MUST BE ODDS OF WINNING PRIZE -> 
GUESSED CORRECTLY 
1 (RED) 1 IN 55.41 $4 
3 (ALL WHITE) 1 IN 360.14 $7 
4 (ALL WHITE) 1 IN 19,087.53 $100 
5 (4 WHITE, 1 RED) 1IN 648,975.96 $10,000 
5 (ALL WHITE) 1 IN 5,153,632.65 $1,000,000 


FIGURE 12.3 

A chart showing the probability of winning and the corresponding 
payouts for different combinations of balls in the Powerball lottery 
game. 


Neo-Darwinists have long assumed that biological evolution works 
something like matching one number in Powerball. In their view, natural 
selection acts to reward or preserve small but relatively probable 
changes in gene sequences—like winning the small but more likely $4 
prize in Powerball over and over again. They assume the mutation and 
selection mechanism doesn’t depend on winning extremely unlikely 
“prizes” (like the whole Powerball jackpot) all at once. 

But what if, to produce a functional advantage at the genetic level, 
the mutation and selection mechanism had to generate the biological 
equivalent of all six (or more) correct balls in the Powerball lottery with 
no reward for guessing a smaller number of balls correctly first? Clearly, 
the probability of this would be extremely small. And the waiting time 
for winning such a lottery could become prohibitively long. 


BACK TO THE BIOLOGY 


That brings us back to Behe and Snoke’s conclusion. In their 2004 paper, 
they argued that generating a single new protein will often require many 
improbable mutations occurring at once. They took into account the 


improbability of multiple functionally necessary mutations appearing 
together—the equivalent of needing to get a Powerball ticket matching 
several numbers to win any money at all. Then they sought to determine 
how long it would take and/or how large the population sizes would 
need to be to generate a new gene via multiple coordinated mutational 
changes—the genetic equivalent of the “jackpot scenario.” 

Behe and Snoke found that if generating a new gene required 
multiple coordinated mutations, then the waiting time would grow 
exponentially with each additional necessary mutational change. They 
also assessed how population sizes affected how long it would take to 
generate new genes, if multiple coordinated mutations were necessary to 
produce those genes. They found, not surprisingly, that just as larger 
populations diminished expected waiting times, smaller populations 
dramatically increased them. 

More important, they found that even if building a new gene required 
just two coordinated mutations, the neo-Darwinian mechanism would 
likely either require huge population sizes or extremely long waiting 
times or both. If coordinated mutations were necessary, then evolution at 
the genetic level faced a catch-22: for the standard neo-Darwinian 
mechanism to generate just two coordinated mutations, it typically either 
needed unreasonably long waiting times, times that exceeded the 
duration of life on earth, or it needed unreasonably large population 
sizes, populations exceeding the number of multicellular organisms that 
have ever lived. To get population sizes that were reasonable, they had 
to have waiting times that were unreasonable. To get waiting times that 
were reasonable, they had to have population sizes that were 
unreasonable. As they put it, either way the “numbers seem 
prohibitive.””° 

Behe and Snoke found that mutation and selection could generate two 
coordinated mutations in a mere | million generations, a reasonable 
length of time given the age of the earth. But that was only in a 
population of 1 trillion or more multicellular organisms, a number that 
exceeds the size of the effective breeding populations of practically all 


individual animal species that have lived at any given time” 


Conversely, they found that mutation and selection could generate two 
coordinated mutations in a population of only 1 million organisms, but 
only if the mechanism had 10 billion generations at its disposal. Yet on 
the assumption that each multicellular organism lived only one year, 10 
billion generations computes to 10 billion years—more than twice the 
age of the earth. This is clearly an unreasonable length of time to wait 
for the emergence of a single gene, let alone more significant 
evolutionary innovations. 

Behe and Snoke did, however, find one tiny “sweet spot” in which a 
gene requiring only two coordinated mutations could arise (see Fig. 
12.4). Such a gene could conceivably arise from 1 billion organisms in a 
“mere” 100 million generations. Since many more than 1 billion 
multicellular organisms have lived on earth during its history and since 
multicellular life on earth has existed for more than 500 million years, 
these numbers offer (assuming, again, one year per generation) the 
prospect of enough time and organisms to generate one new gene—if 
only two coordinated mutations are necessary. (Of course, if the 
population evolving a two-mutation trait had fewer than 1 billion 
organisms, then the waiting times again increased to unreasonable 
lengths.) 
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FIGURE 12.4 


This diagram shows the population sizes and times (measured in number 
of generations) required to produce a gene or trait if building that gene 
or trait requires multiple coordinated mutations. The shaded gray area 
shows the “sweet spot’”—population sizes and available time sufficient 
to generate the coordinated mutations necessary to produce a new gene. 
Note that any multi-mutation feature requiring more than two mutations 
could not, in all probability, evolve by gene duplication and subsequent 
coordinated mutation in a population of multicellular organisms, 
however large. Note also that for most normal population sizes and 
reasonable generation times, even evolving two mutations lies beyond 
the reach of gene duplication, mutation, and selection. Courtesy John 
Wiley and Sons and The Protein Society. 


Nevertheless, these numbers only apply to the case in which only two 
coordinated mutations are necessary to build a new gene. Behe and 
Snoke found that if generating a new functional gene or trait required 
more than two coordinated mutations, then excessively long waiting 
times were necessary regardless of the size of the population. If three or 
more coordinated mutations were necessary, their calculations generated 
no “sweet spots” at all. Thus, they concluded that “the mechanism of 
gene duplication and point mutation alone would be ineffective, at least 
for multicellular . . . species.”° 

In summary, Behe and Snoke applied the principles derived from 
population genetics to evaluate the creative power of the standard neo- 
Darwinian model of gene evolution. They showed that the standard 
model encounters clear probabilistic limits if the structures it needs to 
build require more than two coordinated mutations in multicellular 
eukaryotic organisms. 


THE EDGE OF EVOLUTION AND ITS CRITICS 


Behe and Snoke are well-known critics of the creative power of the neo- 
Darwinian mechanism, so their conclusion might seem suspect to some 
observers. Nevertheless, evolutionary biologists attempting to defend the 


creative power of the neo-Darwinian mechanism have inadvertently 
confirmed Behe and Snoke’s conclusions. 

Two recent scientific publications tell this story. First, in 2007, 
Michael Behe published a book, The Edge of Evolution, amplifying the 
results of his 2004 paper with David Snoke. Using public-health data 
about a genetic trait—resistance to the antimalarial drug chloroquine in 
the one-celled organism that causes malaria—Behe provided another 
line of evidence and argument to support the conclusion that multiple 
coordinated mutations are often necessary to produce even minor 
genetic adaptations. 

Based on public-health data, Behe determined that resistance to 
chloroquine only arises once in every 107° malaria-causing cells. Behe 
inferred, by working the problem backwards, that the trait probably 
required multiple—though not necessarily coordinated—mutations to 
develop. He called this trait a “chloroquine complexity cluster,” or a 
“CCC.”*? Behe wanted to explore what he called the “edge of 
evolution,” the limits to the creative power of mutation and selection at 
the genetic level. Having established that this trait could arise by random 
mutation in a reasonably short period of time, he wondered how much 
time would be required to produce traits of greater complexity in 
populations of various sizes. 

He asked his readers to consider a hypothetical genetic trait twice as 
complex as a CCC cluster—a feature requiring the origin of two 
coordinated traits, each as complex as a CCC. In other words, Behe 
wondered how long it would take to develop a hypothetical trait that 
required two genetic changes as complex as a chloroquine complexity 
cluster, if both changes had to occur together in the same organism—in a 
coordinated fashion—in order to produce the trait. He then showed, 
using the principles of population genetics, that multi-mutation traits of 
that complexity—the molecular equivalent of two coordinated CCCs— 
would require many more organisms or vastly more time than was 
reasonable given the history of life. Remember the Powerball lottery: 
waiting times increase exponentially with each additional coordinated 
change or winning element needed. Behe showed, for example, that if 


10? organisms were required to obtain one CCC, then the square of that 
amount—10*° organisms—would be required to evolve a trait that 
required two coordinated CCCs before providing any advantage.°” But, 
as we saw in Chapter 10, only 10*° total organisms have ever existed on 
earth, implying that the entire history of the earth would barely provide 
enough opportunities to generate a trait of this complexity.*! 

Similarly, Behe reasoned that for organisms in smaller population 
sizes, developing a trait of twice the complexity of a CCC would require 
immensely long waiting times. He also determined that exceedingly long 
waiting times are typically required to generate even less complex 
genetic adaptations in smaller populations. 

Behe showed that the problem of coordinated mutations was 
particularly acute for longer-lived organisms with small population sizes 
—organisms such as mammals or, more specifically, human beings and 
their presumed prehuman ancestors. Behe estimated, based upon 
relevant mutation rates, known human population sizes, and generation 
times, the time required for two coordinated mutations to occur in the 
hominid line. He calculated that producing even such a modest 
evolutionary change would require many hundreds of millions of years. 
Yet, humans and chimps are thought to have diverged from a common 
ancestor only 6 million years ago. Behe’s calculation implied that the 
neo-Darwinian mechanism does not have the capacity to generate even 
two coordinated mutations in the time available for human evolution— 
and thus does not explain how humans arose. 

Here the story gets really interesting. Soon after the publication of 
The Edge of Evolution, two Cornell University mathematical biologists, 
Rick Durrett and Deena Schmidt, both defenders of neo-Darwinism, 
attempted to refute Behe’s conclusion by making their own calculations. 
Their paper, “Waiting for Two Mutations: With Applications to 
Regulatory Sequence Evolution and the Limits of Darwinian Evolution,” 
also applied a model based upon population genetics to calculate the 
amount of time necessary to generate two coordinated mutations in the 
hominid line. Although they calculated a shorter waiting time then Behe 
did, their result nevertheless underscored the implausibility of relying on 


the neo-Darwinian mechanism to generate coordinated mutations during 
the relevant evolutionary timescale. Their calculation suggested that it 
would take not several hundred million years, but “only” 216 million 
years to generate and fix two coordinated mutations in the hominid line 
—more than thirty times the amount of time available to produce 
humans and chimps and all their distinctive complex adaptations and 
differences from their inferred common ancestor. 

In seeking to refute Behe, Durrett and Schmidt inadvertently 
confirmed his main contention. As they acknowledged, their calculation 
implies that generating two or more coordinated mutations is “very 
unlikely to occur on a reasonable timescale.”?? In sum, calculations 
performed by both critics and defenders of neo-Darwinian evolution 
now reinforce the same conclusion: if coordinated mutations are 
necessary to generate new genes and proteins, then the neo-Darwinian 
math itself, as expressed in the principles of population genetics, 
establishes the implausibility of the neo-Darwinian mechanism. 


TESTING THE CO-OPTION OPTION 


But does generating novel genes and proteins require coordinated 
mutations? Behe and Snoke inferred as much based upon an undisputed 
fact of molecular biology: many proteins rely on sets of amino acids 
acting in close coordination in order to perform their functions. In 
addition, in The Edge of Evolution, Behe argued on functional grounds 
that many complex biological systems would require coordinated 
adaptive mutations since in these systems, the absence of even one or a 
few gene products (proteins or traits) will cause them to lose function. 
Behe specifically showed that several molecular machines within cells 
(such as the cilium and intraflagellar transport system, and the bacterial 
flagellar motor>*) require the coordinated interaction of multiple protein 
parts in order to maintain their function. Nevertheless, in making this 
argument, Behe did not address an alternative idea about the pathway by 
which new genes and proteins might have evolved, and thus did not 


establish conclusively that new genes and proteins themselves represent 
complex adaptations. 

Some neo-Darwinists have proposed a model of protein evolution 
known as “co-option.” In this model, a protein that performs one 
function is transformed, or “co-opted” to perform some other function. 
This model envisions new features requiring multiple “mutations” 
arising in a step-by-step way to produce some protein, call it “Protein 
B,” from some other protein that lacked those features, call it “Protein 
A.” In proposing a series of single separate mutations, advocates of co- 
option acknowledge that the initial individual amino-acid changes, the 
first few steps in evolution, from Protein A, the protein lacking the 
multisite feature, would not allow Protein A to perform the function of 
Protein B. Nevertheless, they propose that these initial changes might 
have allowed Protein A to perform some other advantageous function, 
thus making it selectable and preventing protein evolution from 
terminating due to diminution or loss of its initial function. Eventually, 
as mutations continued to generate new proteins with slightly different 
functions, they would have generated a protein close enough in sequence 
and structure that just one or a very few additional changes would 
suffice to convert it into Protein B. 

Aware of these imaginative scenarios, Douglas Axe and his 
colleague, molecular biologist Ann Gauger (see Fig. 12.5), now working 
together at the Biologic Institute in Seattle, decided to put them to an 
ingenious experimental test.** In so doing, they sought to determine 
whether the evolution of new multisite features does indeed typically 
require multiple coordinated mutations, or instead whether such a 
feature could arise by co-option. 





FIGURE 12.5 
Ann Gauger. Courtesy Laszlo Bencze. 


Axe and Gauger scoured protein databases looking for proteins that 
are as similar as possible in sequence and structure, but that nevertheless 
perform different functions. They identified two proteins that meet those 
criteria. One of these proteins (Kbl,) is needed for breaking down an 
amino acid called threonine, and the other (BioF,) is needed for building 


a vitamin called biotin. (See Fig. 12.6.) 

Gauger and Axe realized that if they could transform Kbl, into a 
protein performing the function of BioF, with just one or very few 
coordinated amino-acid changes, then that might demonstrate 
(depending upon how few) that the two proteins were close enough in 
sequence that a conversion in function of the kind envisioned by co- 
option advocates is plausible in evolutionary time. What’s more, because 
they knew the difficulty scientists have had in showing any real change 
of protein function to be feasible, a positive result would suggest that 


they had at last discovered a functional gap that one or very few 
mutations could plausibly jump—as co-option envisioned. 

If, however, they found that many coordinated mutational changes 
were needed, then that could establish—depending upon how many were 
needed—that the Darwinian mechanism could not accomplish the 
functional jump from A to B in a reasonable time. That would imply that 
an even greater degree of structural similarity between proteins would be 
needed for the co-option hypothesis to be plausible. Having carefully 
examined the structural similarities between members of a large class of 
structurally similar enzymes, they knew that Kbl, and BioF, were about 


as close in sequence and structure as any two known proteins that 
performed different functions. Thus, if it turned out that converting one 
protein function into the other required many coordinated mutations— 
more than could be expected to occur in a reasonable time—then the 
outcome of their experiment would have devastating implications for 
standard accounts of protein evolution. If proteins that perform two 
different functions have to be even more similar than Kbl, and BioF, in 
order for mutational changes to convert the function of one to the other, 
then for all practical purposes co-option would not work. There simply 
aren’t many known jumps that small. 
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FIGURE 12.6 
The proteins, Kbl, (left) and BioF, (right) are enzymes that use similar 


catalytic mechanisms to accelerate different chemical reactions in the 
bacterium E. coli. Courtesy Ann Gauger and Douglas Axe. 


Axe and Gauger first identified those amino-acid sites that were most 
likely, if mutated, to cause a change from Kbl, function to BioF, 


function. They then systematically mutated those sites individually and 
in groups involving various amino-acid combinations. Their results were 
unambiguous. They found that they could not induce, with either one or 
a small number of amino acids, the change in function they sought. In 
fact, they found that they could not get Kbl, to perform the function of 


BioF,, even if they mutated larger numbers of amino acids in concert— 


that is, even if they made many more coordinated mutations than could 
plausibly occur by chance in all of evolutionary history. 
Although their attempts to convert Kbl, to perform the function of 


BioF, failed, their experiment did not. It allowed them to establish 


experimentally for the first time that the co-option hypothesis of protein 
evolution lacks credibility—simply too many coordinated mutations 
would be required to convert one protein function to another, even in the 
case of extremely similar proteins. That implied that generating new 
genes and proteins would require multiple coordinated mutations, and 
thus, the waiting times that Behe and Snoke had calculated do present a 
problem for neo-Darwinian theory. 

The experimental work also enabled Axe to calculate expected 
waiting times for various numbers of coordinated mutations given 
different variables and factors. Axe developed a refined population- 
genetics mathematical model to calculate various waiting times. His 
results roughly confirmed the previous calculations of Behe and Snoke. 
He found, for example, that if he took into account the probable fitness 
cost to an organism of carrying unnecessary gene duplicates (as was 
necessary to give the evolution of a new gene a reasonable chance), that 
the probable waiting time for even three coordinated mutations exceeded 
the duration of life on earth. 

He therefore effectively determined an upper bound of two for the 
number of coordinated mutations that could be expected to occur in a 
duplicate gene during the history of life on earth (taking into account the 
negative effects of carrying gene duplicates in the evolutionary process). 


He also calculated six coordinated mutations as an upper bound, 
neglecting the fitness cost of carrying gene duplicates. Nevertheless, in 
their experiments, he and Gauger could not induce a functional change 
in a single gene with more than six coordinated mutations. So, even that 
more generous—and, again, unrealistically generous upper bound—does 
little to render the co-option hypothesis credible. Indeed, Axe and 
Gauger’s experiments showed that the smallest realistically conceivable 
step exceeded what was plausible given the time available to the 
evolutionary process. In their words, “evolutionary innovations requiring 
that many changes . . . would be extraordinarily rare, becoming probable 
only on timescales much longer than the age of life on earth.” 


WHAT IT ALL MEANS 


By showing the implausibility of the co-option model of protein 
evolution and the need for multiple coordinated mutations in order to 
generate multisite features in proteins, Axe and Gauger confirmed that 
genes and proteins themselves represent complex adaptations—entities 
that depend upon the coordinated interaction of multiple subunits that 
must arise as a group to confer any functional advantage. 

The need for coordinated mutations means that evolutionary 
biologists cannot just assume that mutations will readily generate new 
genes and traits, as neo-Darwinists have long presupposed. Indeed, by 
applying mathematical models based on the standard principles of 
population genetics to the questions of the origin of genes themselves, 
Behe and Snoke, Durrett and Schmidt (inadvertently), Axe and Gauger, 
and other biologists*> have recently shown that generating the number of 
multiple coordinated mutations needed to produce even one new gene or 
protein is unlikely to occur within a realistic waiting time. Thus, these 
biologists establish the implausibility of the neo-Darwinian mechanism 
as a means of generating new genetic information. 

There is one other aspect of this. The body of work published 
between 2004 and 2011 also provides additional confirmation of Axe’s 
research showing the rarity of genes and proteins in sequence space. In 


fact, that research helps to explain why such long waiting times are 
necessary. If functional sequences are rare in sequence space, it stands to 
reason that finding them by purely random and undirected means will 
take a long time. Moreover, waiting times increase exponentially with 
each additional necessary mutation. Thus, long waiting times for the 
production of new functional genes and proteins is exactly what we 
should expect if indeed functional genes and proteins are rare, and if 
coordinated mutations are necessary to produce them. Thus, the various 
experiments and calculations performed between 2004 and 2011 
indirectly confirm Axe’s earlier conclusion about the rarity of functional 
genes and proteins and supply further evidence that the neo-Darwinian 
mechanism cannot generate the information necessary to build new 
genes, let alone a new form of animal life, in the time available to the 
evolutionary process. 


THE MATH AND THE MECHANISM 


There is a concluding irony in all this. The researchers calculating 
waiting times for the appearance of complex adaptations have in each 
case done so using models based on the core principles of population 
genetics, the mathematical expression of neo-Darwinian theory. In a real 
sense, therefore, the neo-Darwinian math is itself showing that the neo- 
Darwinian mechanism cannot build complex adaptations—including the 
new information-rich genes and proteins that would have been necessary 
to build the Cambrian animals. To adapt a metaphor that Tom Frazzetta 
might appreciate, the snake has eaten its own tail. 


13 
The Origin of Body Plans 


Rarely have the implications of a Nobel Prize-winning scientific 
discovery received so little notice. Of course, the discovery itself 
received great acclaim. But the deeper meaning was another matter. 

Starting in the autumn of 1979, at the European Molecular Biology 
Laboratory in Heidelberg, two venturesome young geneticists, 
Christiane Nusslein-Volhard and Eric Wieschaus (see Fig. 13.1), 
generated thousands of mutations to investigate the genomes of tens of 
thousands of fruit flies (species: Drosophila melanogaster). They hoped 
to get them to divulge the secrets of embryological development. In 
technical jargon, Nusslein-Volhard and Wieschaus performed 
“saturation mutagenesis” experiments. After feeding male flies the 
potent mutation-causing chemical (i.e., mutagen) ethyl methane 
sulphonate (EMS), Niisslein-Volhard and Wieschaus bred the males 
with virgin females. They then examined the offspring larvae for visible 
defects. 

In generating many thousands of mutants, thereby “saturating” the 
Drosophila genome, Nusslein-Volhard and Wieschaus induced 
variations in the small subset of genes that specifically regulate 
embryonic development. These regulatory genes normally control the 
expression of many other genes that build the fly embryo, progressively 
subdividing it into regions that will become the head, thorax, and 
abdomen of the adult fly. The EMS mutagen disrupts DNA replication, 
thereby mutating genes. These mutations affect the process of 
development, leaving visible defects in the fly larvae. By observing the 
damaged larvae, Wieschaus and Nusslein-Volhard inferred how specific 


genes regulate the development of different parts of the fly body plan. 
In essence, Wieschaus and Nusslein-Volhard reverse-engineered the 
fly’s genome to determine the function of the different genes, including 
the regulatory genes crucial to fly development. ! 

The thoroughness and novelty of the “Heidelberg screens” (as the 
experiments came to be known) and their importance for revealing the 
mechanisms of regulatory control during animal embryogenesis won the 
attention of the Nobel committee. In 1995, the committee awarded the 
Nobel Prize in Medicine or Physiology to Nusslein-Volhard and 
Wieschaus. “This work was revolutionary,” University of Cambridge 
geneticist Daniel St. Johnston explained, “because it was the first 
mutagenesis in any multicellular organism that attempted to find most 
or all of the mutations that affect . . . the essential patterning genes that 
are used throughout development.” 

That’s the story as it is usually told. And it’s correct, as far as it goes. 
But the mutant fruit flies obtained by Nisslein-Volhard and Wieschaus 
tell another story—one less widely known, but one containing important 
clues for the unsolved mystery of the origin of animal body plans. 

Wieschaus himself alluded to these clues in a memorable interaction 
at the 1982 meeting of the American Association for the Advancement 
of Science (AAAS). After a session on the processes of macroevolution 
in which Wieschaus had presented a paper, one audience member asked 
him what he meant by the term “strong” as he used it to describe the 
mutations he and Niusslein-Volhard had induced in flies. Wieschaus 
explained with a laugh that “strong” certainly did not mean alive. 
Without exception, the mutants he studied perished as deformed larvae 
long before achieving reproductive age. “No, dead is dead,” he joked, 
“and you can’t be more dead.”? 

Another questioner then asked Wieschaus about the implications of 
his findings for evolutionary theory. Here Wieschaus responded more 
soberly, wondering aloud about whether his collection of mutants 
offered any insights into how the evolutionary process could have 
constructed novel body plans. “The problem is, we think we’ve hit all 
the genes required to specify the body plan of Drosophila,” he said, 


“and yet these results are obviously not promising as raw materials for 
macroevolution. The next question then, I guess, is what are—or what 
would be—the right mutations for major evolutionary change? And we 
don’t know the answer to that.’* 

Thirty years later, developmental and evolutionary biologists still 
don’t know the answer to that question. At the same time, mutagenesis 
experiments—on fruit flies as well as on other organisms such as 
nematodes (roundworms), mice, frogs, and sea urchins—have raised 
troubling questions about the role of mutations in the origin of animal 
body plans. If mutating the genes that regulate body-plan construction 
destroy animal forms as they develop from an embryonic state, then 
how do mutations and selection build animal body plans in the first 
place? 

The neo-Darwinian mechanism has failed to explain the generation 
of new genes and proteins needed for building the new animal forms 
that arose in the Cambrian explosion. But even if mutation and selection 
could generate fundamentally new genes and proteins, a more 
formidable problem remains. To build a new animal and establish its 
body plan, proteins need to be organized into higher-level structures. In 
other words, once new proteins arise, something must arrange them to 
play their parts in distinctive cell types. These distinctive cell types 
must, in turn, be organized to form distinctive tissues, organs, and body 
plans. This process of organization occurs during embryological 
development. Thus, to explain how animals are actually built from 
smaller protein components, scientists must understand the process of 
embryological development. 
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FIGURE 13.1 
Figure 13.la (left): Christiane Nüsslein-Volhard. Courtesy Wikimedia Commons, user Rama. 
Figure 13.1b (right): Eric F. Wieschaus. 





THE ROLE OF GENES AND PROTEINS IN ANIMAL 
DEVELOPMENT 


As much as any other subdiscipline of biology, developmental biology 
has raised disquieting questions for neo-Darwinism. Developmental 
biology describes the processes, called ontogeny, by which embryos 
develop into mature organisms. Within the past three decades the field 
has dramatically advanced our understanding of how body plans arise 
during ontogeny. Much of this new knowledge has come from studying 
so-called model systems—organisms that biologists can easily mutate in 
the lab, such as the fruit fly Drosophila and the nematode 
Caenorhabditis elegans. 

Although the exact details of animal development can vary in 
bewildering ways depending on the species, all animal development 
exemplifies a common imperative: start with one cell, end with many 
different cells. In most animal species, development begins with the 
fertilized egg. Once the egg divides into its daughter cells, becoming an 
embryo, the organism begins heading toward a well-defined target, 
namely, an adult form that can reproduce. Arriving at that distant target 
requires the embryo to produce many specialized cell types, in the 
correct positions and at the right time. 

Cell differentiation involves coordinating the expression of specific 
genes in space and time, as the number of cells, taking on their different 


roles, rises from one to two to four to eight, doubling and doubling until 
it reaches thousands, millions, and even trillions, depending on the 
species. The number of cell divisions and the total number of cells 
reflects the number of different cell types the adult needs. This in turn 
requires producing different proteins for different cell types. 

For example, the specialized digestive proteins that service the cells 
lining the adult intestine differ from proteins expressed in a neuron 
found in the nerve tract of a limb. They must differ because each 
performs dramatically different functions. So, during development, the 
appropriate genes must be turned on, or “up-regulated,” and turned off, 
or “down-regulated,” to ensure the production of the correct protein 
products at the right time and in the right cell types. 

Specific proteins play active roles in regulating the expression of 
genes for building other proteins. The protein actors playing these 
coordinating roles are known as transcriptional regulators (TRs) or 
transcription factors (TFs). TRs (or TFs) usually bind directly to 
specific sites in DNA, either preventing (repressing) or enabling 
(activating) the transcription of specific genes into RNA. TRs or TFs 
convey instructions about which genes to turn on or turn off. Their 
three-dimensional geometries exhibit characteristic DNA-binding 
features, including a specific domain of 61 amino acids that wraps 
around the DNA double helix. Other transcription factors include the 
zinc finger and leucine zipper motifs that also bind to DNA. 
Transcriptional regulators and factors are themselves controlled by 
complex circuits and signals transmitted by other genes and proteins, 
the overall complexity and precision of which is breathtaking. 

Painstaking genetic research—performed by Nusslein-Volhard and 
Wieschaus and many other developmental biologists-—has uncovered 
many of the key embryonic regulatory genes that help switch cells into 
their differentiated adult types. This research also uncovered a profound 
difficulty cutting to the very core of the neo-Darwinian view of life. 


EARLY-ACTING BODY-PLAN MUTATIONS AND EMBRYONIC 
LETHALS 


To create significant changes in the forms of animals requires attention 
to timing. Mutations in genes expressed late in the development of an 
animal will affect relatively few cells and architectural features. That’s 
because by late in development the basic outlines of the body plan have 
already been established.° Late-acting mutations therefore cannot cause 
any significant or heritable changes in the form or body plan of the 
whole animal. Mutations that are expressed early in development, 
however, may affect many cells and could conceivably produce 
significant changes in the form or body plan, especially if these changes 
occur in key regulatory genes.’ Thus, mutations that are expressed early 
in the development of animals have probably the only realistic chance 
of producing large-scale macroevolutionary change. As evolutionary 
geneticists Bernard John and George Miklos explain, 
“macroevolutionary change” requires changes in “very early 
embryogenesis.”” Former Yale University evolutionary biologist Keith 
Thomson concurs: only mutations expressed early in the development 
of organisms can produce large-scale macroevolutionary change. !° 

Yet from the first experiments by geneticist T. H. Morgan 
systematically mutating fruit flies early in the twentieth century until 
today, as many model species have been subjected to mutagenesis, 
developmental biology has shown that mutations affecting body-plan 
formation expressed early in development inevitably damage the 
organism.'! (See Fig. 13.2, for examples.) As one of the founders of 
neo-Darwinism geneticist R. A. Fisher noted, such mutations are “either 
definitely pathological (most often lethal) in their effects,” or they result 
in an organism that cannot survive “in the wild state.” 1? 

Normal development in any animal can be represented as an 
expanding network of decisions, where the earliest (upstream) decisions 
have greater impact than those occurring later. Regulatory genes and 
their DNA-binding protein products help to control this unfolding 
network of decisions—such that if regulatory proteins are altered or 
destroyed by mutation, the effects cascade downstream into the whole 
developmental process. The earlier the failure, the more widespread the 


destruction. Geneticist Bruce Wallace explains why early-acting 
mutations are thus overwhelmingly likely to disrupt animal 
development. “The extreme difficulty encountered,” he observes, “when 
attempting to transform one organism into another . . . still functional 
one lies in the difficulty in resetting a number of the many controlling 
switches in a manner that still allows for the individual’s orderly 
(somatic) development.” !3 

Nüsslein-Volhard and Wieschaus discovered this problem in 
experiments performed on fruit flies after their first Nobel Prize— 
winning efforts. In these later experiments they studied protein 
molecules that influence the organization of different types of cells early 
in the process of embryological development. These molecules, called 
“morphogens,” including one called Bicoid, are critical to establishing 
the fruit fly’s head-to-tail axis. They found that when these early-acting, 
body-plan-affecting molecules are perturbed, development shuts down. 
When mutations occur in the gene that codes for Bicoid, the resulting 
embryos die!4—as they do in all other known cases in which mutations 
occur early in the regulatory genes that affect body-plan formation. 

There are good functional reasons for this, familiar to us from the 
logic of other complex systems. If an automaker modifies a car’s paint 
color or seat covers, nothing else needs to be altered for the car to 
operate, because the normal function of the car does not depend upon 
these features. But if an engineer changes the length of the piston rods 
in the car’s engine, and does not modify the crankshaft accordingly, the 
engine won’t run. Similarly, animal development is a tightly integrated 
process in which various proteins and cell structures depend upon each 
other for their function, and later events depend crucially on earlier 
events. As a result, one change early in the development of an animal 
will require a host of other coordinated changes in separate but 
functionally interrelated developmental processes and entities 
downstream.! This tight functional integration helps explain why 
mutations early in development inevitably result in embryonic death and 
why even mutations expressed somewhat later in development 
commonly leave organisms crippled. 


Looking more closely at a specific experimental result of this kind 
further illuminates the problem. A mutation in the regulatory 
Ultrabithorax gene (expressed midway in the development of a fly) 
produces an extra pair of wings on a normally two-winged creature. 
Although an extra set of wings may sound like a useful piece of 
equipment, it’s not at all. This “innovation” results in a crippled insect 
that cannot fly because it lacks, among other things, a musculature to 
support the use of its new wings. Because the developmental mutation 
was not accompanied by the many other coordinated developmental 
changes that were needed to make the wings useful, the mutation is 
decidedly harmful. 





THE "SHORT WINGS" MUTANT THE "CURLY WINGS" MUTANT 
CANNOT FLY CANNOT FLY 





THE "EYELESS" MUTANT THE "ANTENNAPEDIA" MUTANTS 
IŞ COMPLETELY BLIND HAVE LEGS WHERE THEIR 
ANTENNAE SHOULD BE AND 
CANNOT REPRODUCE. 
FIGURE 13.2 


Examples of deleterious macromutations produced by experiments on 
fruit flies, including the “short wings,” “curly wings,” “eyeless,” and 
Antennapedia mutants. 
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This problem has led to what Georgia Tech geneticist John F. 
McDonald has called a “great Darwinian paradox.”!® He notes that the 
genes that are obviously variable within natural populations seem to 
affect only minor aspects of form and function—while those genes that 


govern major changes, the very stuff of macroevolution, apparently do 
not vary or vary only to the detriment of the organism. As he puts it, 
“Those [genetic] /oci that are obviously variable within natural 
populations do not seem to lie at the basis of many major adaptive 
changes, while those /oci that seemingly do constitute the foundation of 
many if not most major adaptive changes are not variable within natural 
populations.”!” In other words, the kind of mutations the evolutionary 
process would need to produce new animal body plans—namely, 
beneficial regulatory changes expressed early in development—don’t 
occur. Whereas, the kind that it doesn’t need—viable genetic mutations 
in DNA generally expressed late in development—do occur. Or put 
more succinctly, the kind of mutations we need for major evolutionary 
change we don’t get; the kind we get we don’t need. 

My Discovery Institute colleague Paul Nelson (see Fig. 13.3), a 
philosopher of biology who specializes in evolutionary theory and 
developmental biology, summarizes the challenge to neo-Darwinism 
posed by animal development as three premises: 


1. Animal body plans are built in each generation by a stepwise 
process, from the fertilized egg to the many cells of the adult. The 
earliest stages in this process determine what follows. 

2. Thus, to evolve any body plan, mutations expressed early in 
development must occur, must be viable, and must be stably 
transmitted to offspring. 

3. Such early-acting mutations of global effect on animal development, 
however, are those /east likely to be tolerated by the embryo and, in 
fact, never have been tolerated in any animals that developmental 
biologists have studied. 





FIGURE 13.3 
Paul Nelson. Courtesy Paul Nelson. 


Nelson came to appreciate the depth of the problem posed by these 
facts after many years of discussion with two members of his University 
of Chicago Ph.D. committee, evolutionary biologist Leigh Van Valen 
(1935-2010) and evolutionary theorist and philosopher of biology 
William Wimsatt. Van Valen, famous for his “Red Queen hypothesis” 
about the need for organisms to continue to evolve in order to maintain 
fitness, was passionately interested in the mechanisms of 
macroevolution. Wimsatt originated the theory of “generative 
entrenchment,” an account of the “causal asymmetries” at work in 
complex systems, including those responsible for animal 
development.'® Both acknowledged to Nelson that the scientific 
literature offers no examples of viable mutations affecting early animal 
development and body-plan formation (Premise 3, on previous page) 
and also that the macroevolution of novel animal form requires just such 
early-acting mutations (Premise 2, on previous page). Nevertheless, 
both Van Valen and Wimsatt remained committed to the descent of 
animal forms from a common ancestor via some kind of undirected 


mutations. Nelson argues, however, that those premises strongly imply 
that the neo-Darwinian mechanism does not—and indeed cannot— 
provide an adequate mechanism for producing new animal body plans. 
As he has told me: “If the only kind of mutations that can conceivably 
produce enough morphological change to alter whole body plans never 
causes beneficial and heritable changes, then it is difficult to see how 
mutation and selection could ever produce new body plans in the first 
place.”!° 
Thus, he concludes: 


Research on animal development and macroevolution over the 
last thirty years—research done from within the neo-Darwinian 
framework—has shown that the neo-Darwinian explanation for 
the origin of new body plans is overwhelmingly likely to be false 
—and for reasons that Darwin himself would have understood. 


Indeed, Darwin himself insisted that “nothing can be effected” by 
natural selection, “unless favorable variations occur.”2° Or as Swedish 
evolutionary biologist Soren Løvtrup succinctly explains: “Without 
variation, no selection; without selection, no evolution. This assertion is 
based on logic of the simplest kind. . . . Selection pressure as an 
evolutionary agent becomes void of sense unless the availability of the 
proper mutations is assumed.””! Yet the “proper” kind of mutations— 
the mutations that produce favorable changes to early-acting, body- 
plan—shaping, regulatory genes—do not occur. 

Microevolutionary change is insufficient; macromutations—large- 
scale changes—are harmful. This paradox has beset Darwinism from its 
inception, but discoveries about the genetic regulation of development 
in animals have made this paradox more acute and cast serious doubt on 
the efficacy of the modern neo-Darwinian mechanism as an explanation 
for the new body plans that arise in the Cambrian period. 


DEVELOPMENTAL GENE REGULATORY NETWORKS 


Another line of research in developmental biology has revealed a related 
challenge to the creative power of the neo-Darwinian mechanism. 
Developmental biologists have discovered that many gene products 
(proteins and RNAs) needed for the development of specific animal 
body plans transmit signals that influence the way individual cells 
develop and differentiate themselves. Additionally, these signals affect 
how cells are organized and interact with each other during 
embryological development. These signaling molecules influence each 
other to form circuits or networks of coordinated interaction, much like 
integrated circuits on a circuitboard. For example, exactly when a 
signaling molecule gets transmitted often depends upon when a signal 
from another molecule is received, which in tum affects the 
transmission of still others—all of which are coordinated and integrated 
to perform specific time-critical functions. The coordination and 
integration of these signaling molecules in cells ensures the proper 
differentiation and organization of distinct cell types during the 
development of an animal body plan. Consequently, just as mutating an 
individual regulatory gene early in the development of an animal will 
inevitably shut down development, so too will mutations or alterations 
in the whole network of interacting signaling molecules destroy a 
developing embryo. 

No biologist has explored the regulatory logic of animal 
development more deeply than Eric Davidson, at the California Institute 
of Technology. Early in his career, collaborating with molecular 
biologist Roy Britten, Davidson formulated a theory of “gene regulation 
for higher cells.”?* By “higher cells” Davidson and Britten meant the 
differentiated, or specialized, cells found in any animal after the earliest 
stages of embryological development. Davidson observed that the cells 
of an individual animal, no matter how varied in form or function, 
“generally contain identical genomes.”*? During the life cycle of an 
organism, the genomes of these specialized cells express only a small 
fraction of their DNA at any given time and produce different RNAs as 
a result. These facts strongly suggest that some animal-wide system of 
genetic control functions to turn specific genes on and off as needed 


throughout the life of the organism—and that such a system functions 
during the development of an animal from egg to adult as different cell 
types are being constructed. 

When they proposed their theory in 1969, Britten and Davidson 
acknowledged that “little is known . . . of the molecular mechanisms by 
which gene expression is controlled in differentiated cells.”74 
Nevertheless, they deduced that such a system must be at work. Given: 
(1) that tens or hundreds of specialized cell types arise during the 
development of animals, and (2) that each cell contains the same 
genome, they reasoned (3) that some control system must determine 
which genes are expressed in different cells at different times to ensure 
the differentiation of different cell types from each other—some system- 
wide regulatory logic must oversee and coordinate the expression of the 
genome.” 

Davidson has dedicated his career to discovering and describing the 
mechanisms by which these systems of gene regulation and control 
work during embryological development. During the last two decades, 
research in genomics has revealed that nonprotein-coding regions of the 
genome control and regulate the timing of the expression of the protein- 
coding regions of the genome. Davidson has shown that the nonprotein- 
coding regions of DNA that regulate and control gene expression and 
the protein-coding regions of the genome together function as circuits. 
These circuits, which Davidson calls “developmental gene regulatory 
networks” (or dGRNs) control the embryological development of 
animals. 
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FIGURE 13.4 

Developmental gene regulatory networks (dGRNs) and development in 
the purple sea urchin, Strongylocentrotus purpuratus. Figure 13.4a (top, 
left): shows the actual embryo, starting at 6 hours and progressing 
through cell division to 55 hours, when the larval skeleton appears. 
Figure 13.4b (bottom, left): depicts the major classes of genes involved 
in specifying the larval skeleton. Figure 13.4c (top, right): shows the 
detailed genetic circuitry implicated in the overall “gene regulatory 
network” (“GRN”) that controls the construction of the larval skeleton. 
Courtesy National Academy of Sciences, U.S.A. 


On arriving at Caltech in 1971, Davidson chose the purple sea 
urchin, Strongylocentrotus purpuratus, as his experimental model 
system. The biology of S. purpuratus makes it an attractive laboratory 
subject: the species occurs abundantly along the Pacific coast, produces 
enormous quantities of easily fertilized eggs in the lab, and lives for 
many years.*° Davidson and his coworkers pioneered the technology 
and experimental protocols required to dissect the sea urchin’s genetic 
regulatory system. 

The remarkable complexity of what they found needs to be depicted 
visually. Figure 13.4a shows the urchin embryo as it appears six hours 
after development has begun (top left of diagram). This is the 16-cell 
stage, meaning that four rounds of cell division have already occurred (1 
— 2 — 4 — 8 — 16). As development proceeds in the next four stages, 
both the number of cells and the degree of cellular specialization 
increases, until, at 55 hours, elements of the urchin skeleton come into 
focus. Figure 13.4b shows, corresponding to these drawings of embryo 
development, a schematic diagram with the major classes of genes (for 
cell and tissue types) represented as boxes, linked by control arrows. 
Last, Figure 13.4c shows what Davidson calls “the genetic circuitry” 
that turns on the specific biomineralization genes that produce the 
structural proteins needed to build the urchin skeleton.”’ 

This last diagram represents a developmental gene regulatory 
network (or dGRN), an integrated network of protein and RNA- 


signaling molecules responsible for the differentiation and arrangement 
of the specialized cells that establish the rigid skeleton of the sea urchin. 
Notice that, to express the biomineralization genes that produce 
structural proteins that make the skeleton, genes far upstream, activated 
many hours earlier in development, must first play their role. 

This process does not happen fortuitously in the sea urchin but via 
highly regulated and precise control systems, as it does in all animals. 
Indeed, even one of the simplest animals, the worm C. elegans, 
possessing just over 1,000 cells as an adult, is constructed during 
development by dGRNs of remarkable precision and complexity. In all 
animals, the various dGRNs direct what Davidson describes as the 
embryo’s “progressive increase in complexity”—an increase, he writes, 
that can be measured in “informational terms.”7° 

Davidson notes that, once established, the complexity of the dGRNs 
as integrated circuits makes them stubbornly resistant to mutational 
change—a point he has stressed in nearly every publication on the topic 
over the past fifteen years. “In the sea urchin embryo,” he points out, 


“disarming any one of these subcircuits produces some abnormality in 
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expression. 

Developmental gene regulatory networks resist mutational change 
because they are organized hierarchically. This means that some 
developmental gene regulatory networks control other gene regulatory 
networks, while some influence only the individual genes and proteins 
under their control. At the center of this regulatory hierarchy are the 
regulatory networks that specify the axis and global form of the animal 
body plan during development. These dGRNs cannot vary without 
causing catastrophic effects to the organism. 

Indeed, there are no examples of these deeply entrenched, 
functionally critical circuits varying at all. At the periphery of the 
hierarchy are gene regulatory networks that specify the arrangements 
for smaller-scale features that can sometimes vary. Yet, to produce a 
new body plan requires altering the axis and global form of the animal. 
This requires mutating the very circuits that do not vary without 
catastrophic effects. As Davidson emphasizes, mutations affecting the 


dGRNs that regulate body-plan development lead to “catastrophic loss 
of the body part or loss of viability altogether.”°° He explains in more 
detail: 


There is always an observable consequence if a dGRN subcircuit 
is interrupted. Since these consequences are always 
catastrophically bad, flexibility is minimal, and since the 
subcircuits are all interconnected, the whole network partakes of 
the quality that there is only one way for things to work. And 
indeed the embryos of each species develop in only one way.*! 


ENGINEERING CONSTRAINTS 


Davidson’s findings present a profound challenge to the adequacy of the 
neo-Darwinian mechanism. Building a new animal body plan requires 
not just new genes and proteins, but new dGRNs. But to build a new 
dGRN from a preexisting dGRN by mutation and selection necessarily 
requires altering the preexisting developmental gene regulatory network 
(the very kind of change that, as we saw in Chapter 12, cannot arise 
without multiple coordinated mutations). In any case, Davidson’s work 
has also shown that such alterations inevitably have catastrophic 
consequences. 

Davidson’s work highlights a profound contradiction between the 
neo-Darwinian account of how new animal body plans are built and one 
of the most basic principles of engineering—the principle of constraints. 
Engineers have long understood that the more functionally integrated a 
system is, the more difficult it is to change any part of it without 
damaging or destroying the system as a whole. Davidson’s work 
confirms that this principle applies to developing organisms in spades. 
The system of gene regulation that controls animal-body-plan 
development is exquisitely integrated, so that significant alterations in 
these gene regulatory networks inevitably damage or destroy the 
developing animal.?? But given this, how could a new animal body plan, 


and the new dGRNs necessary to produce it, ever evolve gradually via 
mutation and selection from a preexisting body plan and set of dGRNs? 

Davidson makes clear that no one really knows: “contrary to 
classical evolution theory, the processes that drive the small changes 
observed as species diverge cannot be taken as models for the evolution 
of the body plans of animals.”*> He elaborates: 


Neo-Darwinian evolution . . . assumes that all process works the 
same way, so that evolution of enzymes or flower colors can be 
used as current proxies for study of evolution of the body plan. It 
erroneously assumes that change in protein-coding sequence is 
the basic cause of change in developmental program; and it 
erroneously assumes that evolutionary change in body-plan 
morphology occurs by a continuous process. All of these 
assumptions are basically counterfactual. This cannot be 
surprising, since the neo-Darwinian synthesis from which these 
ideas stem was a premolecular biology concoction focused on 
population genetics and . . . natural history, neither of which have 
any direct mechanistic import for the genomic regulatory systems 
that drive embryonic development of the body plan.** 


NOW AND THEN 


Eric Davidson’s work, like that of Nisslein-Volhard and Wieschaus, 
highlights a difficulty of obvious relevance to the Cambrian explosion. 
Typically, paleontologists understand the Cambrian explosion as the 
geologically sudden appearance of new forms of animal life. Building 
these forms requires new developmental programs—including both new 
early-acting regulatory genes and new developmental gene regulatory 
networks. Yet if neither early-acting regulatory genes nor dGRNs can be 
altered by mutation without destroying existing developmental 
programs (and thus animal form), then mutating these entities will leave 
natural selection with nothing favorable to select and the evolution of 
animal form will, at that point, terminate. 


Darwin’s doubt about the Cambrian explosion centered on the 
problem of missing fossil intermediates. Not only have those forms not 
been found, but the Cambrian explosion itself illustrates a profound 
engineering problem that fossil evidence does not address—the problem 
of building a new form of animal life by gradually transforming one 
tightly integrated system of genetic components and their products into 
another. Yet, in the next chapter, we will see that an even more 
formidable problem remains. 


14 
The Epigenetic Revolution 


In 1924, two German scientists, Hans Spemann and Hilda Mangold, 
reported an intriguing experiment, the significance of which could not 
have been fully appreciated at the time, three decades before the 
discovery of the information-bearing properties of DNA. Using 
microsurgery, Spemann and Mangold excised a portion of a newt 
embryo and transplanted that portion into another developing newt 
embryo.! 

They achieved a startling result. The second embryo produced two 
bodies, each with a head and tail, joined together at the belly, not unlike 
Siamese twins. Yet despite dramatically altering the anatomy of the 
embryo, Spemann and Mangold did not alter its DNA. 

Their experiment later suggested a radical possibility: that something 
in addition to DNA profoundly influences the development of animal 
body plans. Other experiments suggested as much. In the 1930s and 
1940s, American biologist Ethel Harvey showed experimentally that sea 
urchin embryos could undergo development up to about 500 cells after 
removal of their nuclei—in other words, without their nuclear DNA.” In 
the 1960s, Belgian scientists chemically blocked the transcription of 
DNA into RNA in amphibian embryos and found that the embryos could 
still develop to the point of containing several thousand cells. In the 
1970s, Canadian biologists showed that a frog embryo could undergo 
early development without its nucleus if the cell division apparatus from 
a sea urchin was injected into the egg.* 


None of these results indicate that embryos can develop fully without 
DNA. In every case, DNA was eventually necessary to complete 
embryological development. Yet these results suggest that DNA is not 
the whole story, that other sources of information are playing important 
roles in directing at least the early stages of animal development. 


ABOVE AND BEYOND: EPIGENETIC INFORMATION 


In 2003, MIT Press published a groundbreaking collection of scientific 
essays titled Origination of Organismal Form: Beyond the Gene in 
Developmental and Evolutionary Biology, edited by two distinguished 
developmental and evolutionary biologists, Gerd Müller, of the 
University of Vienna, and Stuart Newman, of New York Medical 
College. In their volume, Miller and Newman included a number of 
scientific articles describing recent discoveries in genetics and 
developmental biology—discoveries suggesting that genes alone do not 
determine the three-dimensional form and structure of an animal. 
Instead, many of the scientists in their volume reported that so-called 
epigenetic information—information stored in cell structures, but not in 
DNA sequences—plays a crucial role. The Greek prefix epi means 
“above” or “beyond,” so epigenetics refers to a source of information 
that lies beyond the genes. As Miiller and Newman explain in their 
introduction, “Detailed information at the level of the gene does not 
serve to explain form.”” Instead, as Newman explains, “epigenetic” or 
“contextual information” plays a crucial role in the formation of animal 
“body assemblies” during embryological development.° 

Miller and Newman not only highlighted the importance of 
epigenetic information to the formation of body plans during 
development; they also argued that it must have played a similarly 
important role in the origin and evolution of animal body plans in the 
first place. They concluded that recent discoveries about the role of 
epigenetic information in animal development pose a formidable 
challenge to the standard neo-Darwinian account of the origin of these 
body plans—perhaps the most formidable of all. 


In the introductory essay to their volume, Muller and Newman list a 
number of “open questions” in evolutionary biology, including the 
question of the origin of Cambrian-era animal body plans and the origin 
of organismal form generally, the latter being the central topic of their 
book. They note that though “the neo-Darwinian paradigm still 
represents the central explanatory framework of evolution,” it has “no 
theory of the generative.”’ In their view, neo-Darwinism “completely 
avoids [the question of] the origination of phenotypic traits and of 
organismal form.” As they and others in their volume maintain, neo- 
Darwinism lacks an explanation for the origin of organismal form 
precisely because it cannot explain the origin of epigenetic information. 

I first learned about the problem of epigenetic information and the 
Spemann and Mangold experiment while driving to a private meeting of 
Darwin-doubting scientists on the central coast of California in 1993. In 
the car with me was Jonathan Wells (see Fig. 14.1), who was then 
finishing a Ph.D. in developmental biology at the University of 
California at Berkeley. Like some others in his field, Wells had come to 
reject the (exclusively) “gene-centric” view of animal development and 
to recognize the importance of nongenetic sources of information. 

By that time, I had studied many questions and challenges to standard 
evolutionary theories arising out of molecular biology. But epigenetics 
was new to me. On our drive, I asked Wells why developmental biology 
was so important to evolutionary theory and to assessing neo- 
Darwinism. Ill never forget his reply. “Because” he said, “that’s where 
the whole theory is going to unravel.” 





FIGURE 14.1 
Jonathan Wells. Courtesy Laszlo Bencze. 


In the years since, Wells has developed a powerful argument against 
the adequacy of the neo-Darwinian mechanism as an explanation for the 
origin of animal body plans. His argument turns on the importance of 
epigenetic information to animal development. To see why epigenetic 
information poses an additional challenge to neo-Darwinism and what 
exactly biologists mean by “epigenetic” information, let’s examine the 
relationship between biological form and biological information. 


FORM AND INFORMATION 


Biologists typically define “form” as a distinctive shape and 
arrangement of body parts. Organismal forms exist in three spatial 
dimensions and arise in time—in the case of animals during 
development from embryo to adult. Animal form arises as material 
constituents are constrained to establish specific arrangements with an 
identifiable three-dimensional shape or “topography’—one that we 
would recognize as the body plan of a particular type of animal. A 
particular “form,” therefore, represents a highly specific arrangement of 
material components among a much larger set of possible arrangements. 

Understanding form in this way suggests a connection to the notion 
of information in its most theoretically general sense. As I noted in 
Chapter 8, Shannon’s mathematical theory of information equated the 
amount of information transmitted with the amount of uncertainty 
reduced or eliminated in a series of symbols or characters. Information, 
in Shannon’s theory, is thus imparted as some options, or possible 
arrangements, are excluded and others are actualized. The greater the 
number of arrangements excluded, the greater the amount of information 
conveyed. Constraining a set of possible material arrangements, by 
whatever means, involves excluding some options and actualizing 
others. Such a process generates information in the most general sense 
of Shannon’s theory. It follows that the constraints that produce 


biological form also impart information, even if this information is not 
encoded in digital form. 

DNA contains not only Shannon information but also functional or 
specified information. The arrangements of nucleotides in DNA or of 
amino acids in a protein are highly improbable and thus contain large 
amounts of Shannon information. But the function of DNA and proteins 
depends upon extremely specific arrangements of bases and amino acids. 

Similarly, animal body plans represent, not only highly improbable, 
but also highly specific arrangements of matter. Organismal form and 
function depend upon the precise arrangement of various constituents as 
they arise during, or contribute to, embryological development. Thus, 
the specific arrangement of the other building blocks of biological form 
—cells, clusters of similar cell types, dGRNs, tissues, and organs—also 
represent a kind of specified or functional information. 

In Chapter 8, I noted that the ease with which Shannon’s information 
theory applies to molecular biology has sometimes led to confusion 
about the kind of information contained in DNA and proteins. It may 
have also created confusion about the places that specified information 
might reside in organisms. Perhaps because the information-carrying 
capacity of the gene can be so easily measured, biologists have often 
treated DNA, RNA, and proteins as the sole repositories of biological 
information. Neo-Darwinists have assumed that genes possess all the 
information necessary to specify the form of an animal. They have also 
assumed that mutations in genes will suffice to generate the new 
information necessary to build a new form of animal life.? Yet if 
biologists understand organismal form as resulting from constraints on 
the possible arrangements of matter at many levels in the biological 
hierarchy—from genes and proteins, to cell types and tissues, to organs 
and body plans—then biological organisms may well exhibit many 
levels of information-rich structure. Discoveries in developmental 
biology have confirmed this possibility. 


BEYOND GENES 


Many biologists no longer believe that DNA directs virtually everything 
happening within the cell. Developmental biologists, in particular, are 
now discovering more and more ways that crucial information for 
building body plans is imparted by the form and structure of embryonic 
cells, including information from both the unfertilized and fertilized egg. 

Biologists now refer to these sources of information as 
“epigenetic.”!? Spemann and Mangold’s experiment is only one of many 
to suggest that something beyond DNA may be influencing the 
development of animal body plans. Since the 1980s, developmental and 
cell biologists such as Brian Goodwin, Wallace Arthur, Stuart Newman, 
Fred Nijhout, and Harold Franklin have discovered or analyzed many 
sources of epigenetic information. Even molecular biologists such as 
Sidney Brenner, who pioneered the idea that genetic programs direct 
animal development, now insist that the information needed to code for 
complex biological systems vastly outstrips the information in DNA.!! 

DNA helps direct protein synthesis. Parts of the DNA molecule also 
help to regulate the timing and expression of genetic information and the 
synthesis of various proteins within cells. Yet once proteins are 
synthesized, they must be arranged into higher-level systems of proteins 
and structures. Genes and proteins are made from simple building blocks 
—nucleotide bases and amino acids, respectively—arranged in specific 
ways. Similarly, distinctive cell types are made of, among other things, 
systems of specialized proteins. Organs are made of specialized 
arrangements of cell types and tissues. And body plans comprise specific 
arrangements of specialized organs. Yet the properties of individual 
proteins do not fully determine the organization of these higher-level 
structures and patterns.!* Other sources of information must help arrange 
individual proteins into systems of proteins, systems of proteins into 
distinctive cell types, cell types into tissues, and different tissues into 
organs. And different organs and tissues must be arranged to form body 
plans. 


_AN INFORMATION-RICH HIERARCHY 
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The hierarchical layering or arrangement of different sources of 
information. Note that the information necessary to build the lower-level 
electronic components does not determine the arrangement of those 
components on the circuit board or the arrangement of the circuit board 
and the other parts necessary to make a computer. That requires 
additional informational inputs. 


Two analogies may help clarify the point. At a construction site, 
builders will make use of many materials: lumber, wires, nails, drywall, 
piping, and windows. Yet building materials do not determine the floor 
plan of the house or the arrangement of houses in a neighborhood. 
Similarly, electronic circuits are composed of many components, such as 
resistors, capacitors, and transistors. But such lower-level components 
do not determine their own arrangement in an integrated circuit (see Fig. 
14.2). 

In a similar way, DNA does not by itself direct how individual 
proteins are assembled into these larger systems or structures—cell 
types, tissues, organs, and body plans—during animal development. !? 
Instead, the three-dimensional structure or spatial architecture of 
embryonic cells plays important roles in determining body-plan 


formation during embryogenesis. Developmental biologists have 
identified several sources of epigenetic information in these cells. 


CYTOSKELETAL ARRAYS 


Eukaryotic cells have internal skeletons to give them shape and stability. 
These “cytoskeletons” are made of several different kinds of filaments 
including those called the “microtubules.” The structure and location of 
the microtubules in the cytoskeleton influence the patterning and 
development of embryos. Microtubule “arrays” within embryonic cells 
help to distribute essential proteins used during development to specific 
locations in these cells. Once delivered, these proteins perform functions 
critical to development, but they can only do so if they are delivered to 
their correct locations with the help of preexisting, precisely structured 
microtubule or cytoskeletal arrays (see Fig. 14.3). Thus, the precise 
arrangement of microtubules in the cytoskeleton constitutes a form of 
critical structural information. 

These microtubule arrays are made of proteins called tubulin, which 
are gene products. Nevertheless, like bricks that can be used to assemble 
many different structures, the tubulin proteins in the cell’s microtubules 
are identical to one another. Thus, neither the tubulin subunits, nor the 
genes that produce them, account for the differences in the shape of the 
microtubule arrays that distinguish different kinds of embryos and 
developmental pathways. Instead, the structure of the microtubule array 
itself is, once again, determined by the location and arrangement of its 
subunits, not the properties of the subunits themselves. Jonathan Wells 
explains it this way: “What matters in [embryological] development is 
the shape and location of microtubule arrays, and the shape and location 
of a microtubule array is not determined by its units.”!4 For this reason, 
as University of Colorado cell biologist Franklin Harold notes, it is 
impossible to predict the structure of the cytoskeleton of the cell from 
the characteristics of the protein constituents that form that structure. !> 





FIGURE 14.3 

Figure 14.3a (left) shows a still shot from an animation of microtubule 
(at the bottom of the image) made of tubulin proteins. Courtesy Joseph 
Condeelis. Figure 14.3b (right) shows a microscopic image of a large 
section of cytoskeleton made of many microtubules (and other elements) 
inside the cell in cross section. Courtesy The Company of Biologists and 
Journal of Cell Science. 


Another cell structure influences the arrangement of the microtubule 
arrays and thus the precise structures they form and the functions they 
perform. In an animal cell, that structure is called the centrosome 
(literally, “central body”), a microscopic organelle that sits next to the 
nucleus between cell divisions in an undividing cell. Emanating from the 
centrosome is the microtubule array that gives a cell its three- 
dimensional shape and provides internal tracks for the directed transport 
of organelles and essential molecules to and from the nucleus.!° During 
cell division the centrosome duplicates itself. The two centrosomes form 
the poles of the cell-division apparatus, and each daughter cell inherits 
one of the centrosomes; yet the centrosome contains no DNA.!” Though 
centrosomes are made of proteins—gene products—the centrosome 
structure is not determined by genes alone. 


MEMBRANE PATTERNS 


Another important source of epigenetic information resides in the two- 
dimensional patterns of proteins in cell membranes.!® When messenger 
RNAs are transcribed, their protein products must be transported to the 


proper locations in embryonic cells in order to function properly. 
Directed transport involves the cytoskeleton, but it also depends on 
spatially localized targets in the membrane that are in place before 
transport occurs. Developmental biologists have shown that these 
membrane patterns play a crucial role in the embryological development 
of fruit flies. 


Membrane Targets 


For example, early embryo development in the fruit fly Drosophila 
melanogaster requires the regulatory molecules Bicoid and Nanos 
(among others). The former is required for anterior (head) development, 
and the latter is required for posterior (tail) development.!° In the early 
stages of embryological development, nurse cells pump Bicoid and 
Nanos RNAs into the egg. (Nurse cells provide the cell that will become 
the egg—known as the oocyte—and the embryo with maternally 
encoded messenger RNAs and proteins.) Cytoskeletal arrays then 
transport these RNAs through the oocyte, where they become attached to 
specified targets on the inner surface of the egg.”? Once in their proper 
place—but only then—Bicoid and Nanos play critical roles in 
organizing the head-to-tail axis of the developing fruit fly. They do this 
by forming two gradients (or differential concentrations), one with 
Bicoid protein most concentrated at the anterior end and another with 
Nanos protein most concentrated at the posterior end. 

Insofar as both of these molecules are RNAs—that is, gene products 
—genetic information plays an important role in this process. Even so, 
the information contained in the bicoid and nanos genes does not by 
itself ensure the proper function of the RNAs and proteins for which the 
genes code. Instead, preexisting membrane targets, already positioned on 
the inside surface of the egg cell, determine where these molecules will 
attach and how they will function. These membrane targets provide 
crucial information—spatial coordinates—for embryological 
development. 


Ton Channels and Electromagnetic Fields 


Membrane patterns can also provide epigenetic information by the 
precise arrangement of ion channels—openings in the cell wall through 
which charged electrical particles pass in both directions. For example, 
one type of channel uses a pump powered by the energy-rich molecule 
ATP to transport three sodium ions out of the cell for every two 
potassium ions that enter the cell. Since both ions have a charge of plus 
one (Na+, K+), the net difference sets up an electromagnetic field across 
the cell membrane.’! 

Experiments have shown that electromagnetic fields have 
“morphogenetic” effects—in other words, effects that influence the form 
of a developing organism. In particular, some experiments have shown 
that the targeted disturbance of these electric fields disrupts normal 
development in ways that suggest the fields are controlling 
morphogenesis.** Artificially applied electric fields can induce and 
guide cell migration. There is also evidence that direct current can affect 
gene expression, meaning internally generated electric fields can provide 
spatial coordinates that guide embryogenesis.” Although the ion 
channels that generate the fields consist of proteins that may be encoded 
by DNA (just as microtubules consist of subunits encoded by DNA), 
their pattern in the membrane is not. Thus, in addition to the information 
in DNA that encodes morphogenetic proteins, the spatial arrangement 
and distribution of these ion channels influences the development of the 
animal. 


The Sugar Code 


Biologists know of an additional source of epigenetic information stored 
in the arrangement of sugar molecules on the exterior surface of the cell 
membrane. Sugars can be attached to the lipid molecules that make up 
the membrane itself (in which case they are called “glycolipids’”’), or 
they can be attached to the proteins embedded in the membrane (in 
which case they are called “glycoproteins”). Since simple sugars can be 
combined in many more ways than amino acids, which make up 


proteins, the resulting cell surface patterns can be enormously complex. 
As biologist Ronald Schnaar explains, “Each [sugar] building block can 
assume several different positions. It is as if an A could serve as four 
different letters, depending on whether it was standing upright, turned 
upside down, or laid on either of its sides. In fact, seven simple sugars 
can be rearranged to form hundreds of thousands of unique words, most 
of which have no more than five letters.”4 

These sequence-specific information-rich structures influence the 
arrangement of different cell types during embryological development. 
Thus, some cell biologists now refer to the arrangements of sugar 
molecules as the “sugar code” and compare these sequences to the 
digitally encoded information stored in DNA.” As biochemist Hans- 
Joachim Gabius notes, sugars provide a system with “high-density 
coding” that is “essential to allow cells to communicate efficiently and 
swiftly through complex surface interactions.”*° According to Gabius, 
“These [sugar] molecules surpass amino acids and nucleotides by far in 
information-storing capacity.”*’ So the precisely arranged sugar 
molecules on the surface of cells clearly represent another source of 
information independent of that stored in DNA base sequences. 


NEO-DARWINISM AND THE CHALLENGE OF EPIGENETIC 
INFORMATION 


These different sources of epigenetic information in embryonic cells 
pose an enormous challenge to the sufficiency of the neo-Darwinian 
mechanism. According to neo-Darwinism, new information, form, and 
structure arise from natural selection acting on random mutations arising 
at a very low level within the biological hierarchy—within the genetic 
text. Yet both body-plan formation during embryological development 
and major morphological innovation during the history of life depend 
upon a specificity of arrangement at a much higher level of the 
organizational hierarchy, a level that DNA alone does not determine. If 
DNA isn’t wholly responsible for the way an embryo develops—for 
body-plan morphogenesis—then DNA sequences can mutate 


indefinitely and still not produce a new body plan, regardless of the 
amount of time and the number of mutational trials available to the 
evolutionary process. Genetic mutations are simply the wrong tool for 
the job at hand. 

Even in a best-case scenario—one that ignores the immense 
improbability of generating new genes by mutation and selection— 
mutations in DNA sequence would merely produce new genetic 
information. But building a new body plan requires more than just 
genetic information. It requires both genetic and epigenetic information 
—information by definition that is not stored in DNA and thus cannot be 
generated by mutations to the DNA. It follows that the mechanism of 
natural selection acting on random mutations in DNA cannot by itself 
generate novel body plans, such as those that first arose in the Cambrian 
explosion. 


GENE-CENTRIC RESPONSES 


Many of the biological structures that impart important three- 
dimensional spatial information—such as cytoskeletal arrays and 
membrane ion channels—are made of proteins. For this reason, some 
biologists have insisted that the genetic information in DNA that codes 
for these proteins does account for the spatial information in these 
various structures after all. In each case, however, this exclusively 
“gene-centric” view of the location of biological information—and the 
origin of biological form—has proven inadequate. 

First, in at least the case of the sugar molecules on the cell surface, 
gene products play no direct role. Genetic information produces proteins 
and RNA molecules, not sugars and carbohydrates. Of course, important 
glycoproteins and glycolipids (sugar-protein and sugar-fat composite 
molecules) are modified as the result of biosynthetic pathways involving 
networks of proteins. Nevertheless, the genetic information that 
generates the proteins in these pathways only determines the function 
and structure of the individual proteins; it does not specify the 


coordinated interaction between the proteins in the pathways that result 
in the modification of sugars.’ 

More important, the location of specific sugar molecules on the 
exterior surface of embryonic cells plays a critical role in the function 
that these sugar molecules play in intercellular communication and 
arrangement. Yet their location is not determined by the genes that code 
for the proteins to which these sugar molecules might be attached. 
Instead, research suggests that protein patterns in the cell membrane are 
transmitted directly from parent membrane to daughter membrane 
during cell division rather than as a result of gene expression in each 
new generation of cells.2? Since the sugar molecules on the exterior of 
the cell membrane are attached to proteins and lipids, it follows that their 
position and arrangement probably result from membrane-to-membrane 
transmission as well. 

Consider next the membrane targets that play a crucial role in 
embryological development by attracting morphogenetic molecules to 
specific places on the inner surface of the cell. These membrane targets 
consist largely of proteins, most of which are mainly specified by DNA. 
Even so, many “intrinsically disordered”? proteins fold differently 
depending on the surrounding cellular context. This context thus 
provides epigenetic information. Further, many membrane targets 
include more than one protein, and these multiprotein structures do not 
automatically self-organize to form properly structured targets.*! Finally, 
it is not only the molecular structure of these membrane targets, but also 
their specific /ocation and distribution that determines their function. Yet 
the location of these targets on the inner surface of the cell is not 
determined by the gene products out of which they are made any more 
than, for example, the locations of the bridges across the River Seine in 
Paris are determined by the properties of the stones out of which they are 
made. 

Similarly, the sodium-potassium ion pumps in cell membranes are 
indeed made of proteins. Nevertheless, it is, again, the location and 
distribution of those channels and pumps in the cell membrane that 
establish the contours of the electromagnetic field that, in turn, influence 


embryological development. The protein constituents of these channels 
do not determine where the ion channels are located. 

Like membrane targets and ion channels, microtubules are also made 
of many protein subunits, themselves undeniably the products of genetic 
information. In the case of microtubule arrays, defenders of the gene- 
centric view do not claim that individual tubulin proteins determine the 
structure of these arrays. Nevertheless, some have suggested that other 
proteins, or suites of proteins, acting in concert could determine such 
higher-level form. For example, some biologists have noted that so- 
called helper proteins—which are gene products—called “microtubule 
associated proteins” (MAPs) help to assemble the tubulin subunits in the 
microtubule arrays. 

Yet MAPs, and indeed many other necessary proteins, are only part 
of the story. The locations of specified target sites on the interior of the 
cell membrane also help to determine the shape of the cytoskeleton. 
And, as noted, the gene products out of which these targets are made do 
not determine the location of these targets. Similarly, the position and 
structure of the centrosome—the microtubule-organizing center—also 
influences the structure of the cytoskeleton. Although centrosomes are 
made of proteins, the proteins that form these structures do not entirely 
determine their location and form. As Mark McNiven, a molecular 
biologist at the Mayo Clinic, and cell biologist Keith Porter, formerly of 
the University of Colorado, have shown, centrosome structure and 
membrane patterns as a whole convey three-dimensional structural 
information that helps determine the structure of the cytoskeleton and 
the location of its subunits.>* Moreover, as several other biologists have 
shown, the centrioles that compose the centrosomes replicate 
independently of DNA replication: daughter centrioles receive their 
form from the overall structure of the mother centriole, not from the 
individual gene products that constitute them. 

Additional evidence of this kind comes from ciliates, large single- 
celled eukaryotic organisms. Biologists have shown that microsurgery 
on the cell membranes of ciliates can produce heritable changes in 
membrane patterns without altering the DNA.** This suggests that 


membrane patterns (as opposed to membrane constituents) are 
impressed directly on daughter cells. In both cases—in membrane 
patterns and centrosomes—form is transmitted from parent three- 
dimensional structures to daughter three-dimensional structures directly. 
It is not entirely contained in DNA sequences or the proteins for which 
these sequences code.’ 

Instead, in each new generation, the form and structure of the cell 
arises as the result of both gene products and the preexisting three- 
dimensional structure and organization inherent in cells, cell membranes, 
and cyto-skeletons. Many cellular structures are built from proteins, but 
proteins find their way to correct locations in part because of preexisting 
three-dimensional patterns and organization inherent in cellular 
structures. Neither structural proteins nor the genes that code for them 
can alone determine the three-dimensional shape and structure of the 
entities they build. Gene products provide necessary, but not sufficient, 
conditions for the development of three-dimensional structure within 
cells, organs, and body plans.*° If this is so, then natural selection acting 
on genetic variation and mutations alone cannot produce the new forms 
that arise in the history of life. 


EPIGENETIC MUTATIONS 


When I explain this in public talks, I can count on getting the same 
question. Someone in the audience will ask whether mutations could 
alter the structures in which epigenetic information resides. The 
questioner wonders if changes in epigenetic information could supply 
the variation and innovation that natural selection needs to generate new 
form, in much the same way that neo-Darwinists envision genetic 
mutations doing so. It’s a reasonable thing to ask, but it turns out that 
mutating epigenetic information doesn’t offer a realistic way of 
generating new forms of life. 

First, the structures in which epigenetic information inheres— 
cytoskeletal arrays and membrane patterns, for example—are much 
larger than individual nucleotide bases or even stretches of DNA. For 


this reason, these structures are not vulnerable to alteration by many of 
the typical sources of mutation that affect genes such as radiation and 
chemical agent. 

Second, to the extent that cell structures can be altered, these 
alterations are overwhelmingly likely to have harmful or catastrophic 
consequences. The original Spemann and Mangold experiment did, of 
course, involve forcibly altering an important repository of epigenetic 
information in a developing embryo. Yet the resulting embryo, though 
interesting and illustrative of the importance of epigenetic information, 
did not stand a chance of surviving in the wild, let alone reproducing. 

Altering the cell structures in which epigenetic information inheres 
will likely result in embryo death or sterile offspring—for much the 
same reason that mutating regulatory genes or developmental gene 
regulatory networks also produces evolutionary dead ends. The 
epigenetic information provided by various cell structures is critical to 
body-plan development, and many aspects of embryological 
development depend upon the precise three-dimensional placement and 
location of these information-rich cell structures. For example, the 
specific function of morphogenetic proteins, the regulatory proteins 
produced by master regulatory (Hox) genes, and developmental gene 
regulatory networks (dGRNs) all depend upon the location of specific, 
information-rich, and preexisting cell structures. For this reason, altering 
these cell structures will in all likelihood damage something else crucial 
during the developmental trajectory of the organism. Too many different 
entities involved in development depend for their proper function upon 
epigenetic information for such changes to have a beneficial or even 
neutral effect. 

In Chapter 16 I will examine several new theories of evolution, 
including one known as “epigenetic inheritance.” We’ll see that there are 
some additional difficulties associated with the idea that mutations in 
epigenetic structures can produce significant evolutionary innovation. 


DARWIN’S GROWING ANOMALY 


With the publication of On the Origin of Species in 1859, Darwin 
advanced, first and foremost, an explanation for the origin of biological 
form. At the time, he acknowledged that the pattern of appearance of the 
Cambrian animals did not conform to his gradualist picture of the history 
of life. Thus, he regarded the Cambrian explosion as primarily a problem 
of incompleteness in the fossil record. 

In Chapters 2, 3, and 4, I explained why the problem of fossil 
discontinuity exemplified by the Cambrian forms has, since Darwin’s 
time, only intensified. Yet clearly a more fundamental problem now 
afflicts the whole edifice of modern neo-Darwinian theory. The neo- 
Darwinian mechanism does not account for either the origin of the 
genetic or the epigenetic information necessary to produce new forms of 
life. Consequently, the problems posed to the theory by the Cambrian 
explosion remain unsolved. But further, the central problem that Darwin 
set out to answer in 1859, namely the origin of animal form in general, 
remains unanswered—as Muller and Newman in particular have 
noted.>’ 

Contemporary critics of neo-Darwinism acknowledge, of course, that 
preexisting forms of life can diversify under the twin influences of 
natural selection and genetic mutation. Known microevolutionary 
processes can account for small changes in the coloring of peppered 
moths, the acquisition of antibiotic resistance in different strains of 
bacteria, and cyclical variations in the size of Galapagos finch beaks. 
Nevertheless, many biologists now argue that neo-Darwinian theory 
does not provide an adequate explanation for the origin of new body 
plans or events such as the Cambrian explosion. 

For example, evolutionary biologist Keith Stewart Thomson, 
formerly of Yale University, has expressed doubt that large-scale 
morphological changes could accumulate by minor changes at the 
genetic level.’ Geneticist George Miklos, of the Australian National 
University, has argued that neo-Darwinism fails to provide a mechanism 
that can produce large-scale innovations in form and structure.?? 
Biologists Scott Gilbert, John Opitz, and Rudolf Raff have attempted to 
develop a new theory of evolution to supplement classical neo- 


Darwinism, which, they argue, cannot adequately explain large-scale 
macroevolutionary change. As they note: 


Starting in the 197 0s, many biologists began questioning its [neo- 
Darwinism s ] adequacy in explaining evolution. Genetics might be 
adequate for explaining microevolution, but microevolutionary 
changes in gene frequency were not seen as able to turn a reptile 
into amammal or to convert a fish into an amphibian. 
Microevolution looks at adaptations that concern the survival of 
the fittest, not the arrival of the fittest. As Goodwin (1995) points 
out, “the origin of species—Darwin’s problem—remains 


unsolved.” 4? 


Gilbert and his colleagues have tried to solve the problem of the 
origin of form by invoking mutations in genes called Hox genes, which 
regulate the expression of other genes involved in animal development 
—an approach that I will examine in Chapter 16.4! Notwithstanding, 
many leading biologists and paleontologists—Gerry Webster and Brian 
Goodwin, Ginter Theissen, Mare Kirschner, and John Gerhart, Jeffrey 
Schwartz, Douglas Erwin, Eric Davidson, Eugene Koonin, Simon 
Conway Morris, Robert Carroll, Gunter Wagner, Heinz-Albert Becker 
and Wolf-Eckhart Lonnig, Stuart Newman and Gerd Müller, Stuart 
Kauffman, Peter Stadler, Heinz Saedler, James Valentine, Giuseppe 
Sermonti, James Shapiro and Michael Lynch, to name several—have 
raised questions about the adequacy of the standard neo-Darwinian 
mechanism, and/or the problem of evolutionary novelty in particular.47 
For this reason, the Cambrian explosion now looks less like the minor 
anomaly that Darwin perceived it to be, and more like a profound 
enigma, one that exemplifies a fundamental and as yet unsolved problem 
—the origination of animal form. 


Part Three 


After Darwin, What? 
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The Post-Darwinian World and Self- 
Organization 


The year 2009 marked the 150th anniversary of the publication of The 
Origin of Species. In that year, the renowned Cambrian paleontologist 
Simon Conway Morris published an essay in the journal Current Biology 
titled “Walcott, the Burgess Shale and rumours of a post-Darwinian 
world,” assessing the current state of evolutionary biology. “Everywhere 
elsewhere in the Origin the arguments slide one by one skillfully into 
place, the towering edifice rises, and the creationists are left permanently 
in its shadow,” he wrote. “But not when it comes to the seemingly abrupt 
appearance of animal fossils.”! Instead, unresolved problems exposed by 
the Cambrian explosion have, in Conway Morris’s view, “opened the 
way to a post-Darwinian world.”* The evidence we reviewed in the 
previous sections of the book—evidence for a real, rather than merely 
apparent, explosion of animal form in the fossil record, and against the 
neo-Darwinian mechanism as an explanation for the origin of form and 
information—may help to explain why biology has begun to enter such a 
world. 

Moreover, any doubts that at least some biologists have begun to 
embrace a post-Darwinian perspective should have been laid to rest in 
the summer of 2008, when sixteen influential evolutionary biologists 
met for a private conference at the Konrad Lorenz Institute in Altenberg, 
Austria. The scientists, whom the science media later dubbed the 
“Altenberg 16,” met to explore the future of evolutionary theory. These 
biologists had many different, and sometimes conflicting, ideas about 


how new forms of life might have evolved. But all were united by the 
conviction that the neo-Darwinian synthesis had run its course and that 
new evolutionary mechanisms were needed to explain the origin of 
biological form. As paleontologist Graham Budd, who was in 
attendance, explained, “When the public thinks about evolution, they 
think about [things like] the origin of wings. . . . But these are things that 
evolutionary theory has told us little about.” 

Of course, explaining the origin of form is precisely what has made 
the Cambrian explosion so mysterious. In Chapter 7, in discussing the 
idea of punctuated equilibrium, I quoted Cambrian paleontologists 
James Valentine and Douglas Erwin, who concluded exactly that. They 
argued that neither punctuated equilibrium nor neo-Darwinism has 
accounted for the origin of new body plans and that, consequently, 
biology needs a new theory to explain “the evolution of novelty.” 

The Altenberg 16 sought to address this challenge. Since the 
conference, and for nearly two decades preceding it, many evolutionary 
biologists have been working to formulate new theories of evolution, or 
at least new ideas about evolutionary mechanisms with more creative 
power than mutation and natural selection alone. Each of these new 
theories attempts to answer the increasingly urgent question: After 
Darwin—or neo-Darwinism—what? 


THE NEO-DARWINIAN TRIAD 


The neo-Darwinian mechanism rests on three core claims: first, that 
evolutionary change occurs as the result of random, minute variations 
(or mutations); second, that the process of natural selection sifts among 
those variations and mutations, such that some organisms leave more 
offspring than others (differential reproduction) based on the presence or 
absence of certain variations; and third, favored variations must be 
inherited faithfully in subsequent generations of organisms, thus causing 
the population in which they reside to change or evolve over time.° 
Biologists Marc Kirschner and John Gerhart call these three elements— 


variation, natural selection, and heritability—the “three pillars” of neo- 
Darwinian evolution.’ 

Those evolutionary biologists who now doubt orthodox neo- 
Darwinian theory typically question or reject one or more of the 
elements of this neo-Darwinian triad (see Fig. 16.1). Eldredge and Gould 
questioned Darwinian gradualism, which led them to reject the idea that 
mutational change occurs in minute increments (i.e., the first element of 
the neo-Darwinian triad just mentioned). Other evolutionary biologists 
have since rejected other core elements of the mneo-Darwinian 
mechanism and sought to replace them with other mechanisms or 
processes. This chapter will examine a new class of post-neo-Darwinian 
evolutionary models that attempt to explain the origin of biological form 
by deemphasizing the role of random mutations. These models instead 
emphasize the importance of “self-organizational” laws or processes to 
the evolution of biological form. 


SELF-ORGANIZATIONAL MODELS 


Well before the Altenberg 16 convened, a significant number of 
evolutionary theorists had already begun to explore alternatives to the 
neo-Darwinian synthesis. Punctuated equilibrium was one such 
alternative. But as scientific criticisms of that theory began to mount 
during the 1980s and 1990s, a group of scientists associated with a think 
tank in New Mexico, the Santa Fe Institute, developed a new theoretical 
approach. They called it “self-organization.” 

Whereas neo-Darwinism explains the origin of biological form and 
structure as the consequence of natural selection acting on random 
mutations, self-organizational theorists suggest that biological form 
often arises (or “self-organizes”) spontaneously as a consequence of the 
laws of nature (or “laws of form”). Natural selection, they theorize, acts 
to preserve this spontaneously arising order. They think spontaneous 
self-organizing order, not random genetic mutations, typically provides 
the ultimate source of new biological form. Thus, they deemphasize two 


of the three parts of the classical neo-Darwinian triad: random mutations 
and, to a lesser extent, natural selection. 

In 1993, the most prominent scientist associated with the Santa Fe 
Institute, former University of Pennsylvania biochemist Stuart Kauffman 
(see Fig. 15.1), released an eagerly awaited treatise, The Origins of 
Order: Self-Organization and Selection in Evolution. Kauffman 
articulated a trenchant critique of the creative power of the mutation and 
selection mechanism, emphasizing some of the criticisms described here 
in previous chapters. Kauffman advanced a comprehensive alternative 
theory to account for the emergence of new form. In addition, he 
advanced a specific proposal for explaining the Cambrian explosion.’ 
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FIGURE 15.1 
Stuart Kauffman. Courtesy Wikimedia Commons, user Teemu Rajala. 


Kauffman notes that the development of animal body plans involves 
two phases: cell differentiation and body-plan morphogenesis (cell 
organization). He explores the possibility that self-organizational 
processes at work today—specifically in cell differentiation and body- 
plan formation—might help explain how new animal forms originated in 
the past. 

Kauffman proposes, first, that gene regulatory networks in animal 
cells—genes that regulate other genes—influence cell differentiation. 
They do this by generating predictable “pathways of differentiation,” 1° 
patterns by which one type of cell will emerge from another over the 
course of embryological development as cells divide. For example, early 
in embryological development, one type of cell (call it cell type “A”) 
will divide and give rise to two other types of cells (call them types “B” 
and “C”’), which will eventually generate cell types “D” and “E,” and “F 
and G,” respectively, and many other cell types as the process continues. 
Kauffman suggests that these pathways of differentiation “may reflect 
self-organizing features of complex genomic regulatory networks.”!! In 
other words, networks of regulatory genes in embryonic cells determine 
the pathways by which cells divide and differentiate. Since these 
patterns of cell differentiation may be determined by regulatory genes, 
Kauffman regards them as the inevitable by-products of self- 
organizational processes. Moreover, since “pathways of cell 
differentiation [have been] present in all multicellular organisms 
presumably since the Precambrian,”!* he suggests that self-ordering 
properties “inherent in a wide class of genomic regulatory networks” 13 
played a significant role in the origin of the animal forms. 

Kauffman makes a similar case for the importance of self- 
organizational processes during body-plan morphogenesis, the second 
phase of animal development. This phase involves not so much the 
differentiation of one cell type from another, but the arrangement and 


organization of different cell types into the distinct tissues and organs 
that jointly constitute various animal body plans. 

Kauffman again points to known processes of body-plan 
development and suggests that they could have played an important role 
in the formation of the first animal body plans. He cites the importance 
of the structural or “positional”!'* information in cells and cell 
membranes as the crucial determinants of how different cell types are 
organized into different animal forms. I discussed the importance of 
such “epigenetic” information to animal development in Chapter 14 and 
explained why it poses a problem for neo-Darwinian theory. By 
recognizing the importance of such information, Kauffman also rejects 
the neo-Darwinian assumption that a “genetic program” entirely 
determines animal development. He further regards the patterns of 
development that result from this positional information as evidence of 
self-ordering tendencies in matter and the existence of laws of biological 
form. 

Do these self-ordering tendencies or laws of form, if they exist, 
explain the origin of animal body plans and the information necessary to 
build them? They don’t. 


Sel fOrganization and Epigenetic In formation 


To see why let’s look first at how Kauffman attempts to explain the 
epigenetic “positional” information that directs the organization of cells 
in the second phase of animal development. Kauffman attempts to 
explain this “positional” information by offering an entirely hypothetical 
and, ultimately, question-begging proposal. He invokes an idea sketched 
out in the 1940s by the famous English mathematician Alan Turing.!> 
Turing proposed that specific arrangements of cells in animal 
development might ultimately derive from the diffusion and specific 
arrangement of crucial molecules—presumably something like the 
morphogen proteins present in embryonic cells. (Recall that 
morphogens, or morphogen proteins, influence cell differentiation and 
organization during animal development.) Rather than attributing the 


distribution of these morphogenic proteins to preexisting genetic and 
epigenetic information in cells, as occurs during development in modern 
animals, Turing postulated that the distribution of these molecules might 
have originated in the first place independently of such information as 
the result of simple chemical reactions. He imagined one molecule 
producing both a copy of itself (“autocatalyzing’”) and, in addition, 
producing a different molecule as well. Then he envisioned one of these 
molecules inhibiting the production of the other, thus allowing, through 
repeated cycles, the production of more and more of one molecule and 
less and less of the other. Turing thought the resulting nonuniformities in 
the patterns of distribution of those molecules would eventually result in 
nonuniform patterns in the distributions of different cells, possibly 
resulting in different animal forms. 

Kauffman expanded upon this proposal as a way of understanding 
how crucial positional information might organize as the result of 
chemical interactions of different molecules. Nevertheless, his proposal 
suffers from an obvious difficulty: it lacks any chemical or biological 
specificity. In explaining the proposal, Kauffman does not mention any 
specific chemicals or proteins that would behave in the way he 
envisions. Instead, he describes the behavior of hypothetical molecules 
that he labels with the indistinct monikers “X” and “Y.” More important, 
Kauffman offers no evidence that chemicals interacting in the way he 
envisions could create specific biologically relevant configurations or 
distributions of morphogen proteins—apart, that is, from the processes 
that generate specifically arranged distributions of these proteins in 
preexisting information-rich embryonic cells today. 

Instead, it is inherently implausible to think that the specificity 
necessary to coordinate the movements and arrangements of the billions 
or trillions of cells present in adult animal forms could be established by 
the interactions of one or two simple chemicals, even if they formed 
autocatalytic cycles. Kauffman himself seems tacitly to acknowledge the 
difficulty of generating biological specificity from the reactions of 
chemicals alone. He notes, in critique of his own model, that patterns of 
molecular diffusion produced by chemical autocatalysis would depend 


crucially upon “the initial conditions.”!© In other words, getting a 
biologically relevant information-rich arrangement of morphogenic 
proteins would require starting with a very specific (presumably 
information-rich) arrangement of autocatalyzing molecules. 

Kauffman encounters this same problem in attempting to explain the 
origin of the first life as the result of autocatalytic reactions starting from 
a prebiotic soup. In The Origins of Order, he acknowledges that 
generating an autocatalytic, or self-reproducing, set of molecules—a 
crucial step in his origin-of-life scenario—would require “high 
molecular specificity!’ in the initial set of peptides or RNA molecules. 
In other words, it would require specificity of arrangement and 
structure, that is to say, functional information. 


Sel fOrganization and Genetic In formation 


And what about the specifically genetic information necessary to the 
earlier phase of animal development? Does Kauffman’s self- 
organizational theory explain the origin of the “genetic regulatory 
networks” necessary to cell differentiation? Again, it does not. Instead, 
in an even more obvious way, it begs the question of the origin of these 
regulatory networks. Indeed, though Kauffman discusses cell 
differentiation as a kind of “self-ordering” or self-organizational process, 
he acknowledges that the predictable pathways of differentiation that 
characterize this process derive from preexisting gene regulatory 
networks. As Kauffman notes, the spontaneous ordering tendencies in 
cell differentiation are “inherent in a wide class of genomic regulatory 
networks.”!® Indeed, the genetic information in the gene regulatory 
networks does not come from  self-ordering processes of cell 
differentiation. Instead, cell differentiation, to the extent that it can be 
properly described as “self-ordering,” results from preexisting genetic 
sources of information. Thus, the self-organizational process that 
Kauffman cites cannot explain the origin of genetic information, because 
it derives from it, as Kauffman’s own description reveals. 


In a later book, At Home in the Universe: The Search for the Laws of 
Self-Organization and Complexity, Kauffman does offer computer 
simulations of two “model systems”!? that seek to explain, at least in 
principle, how genetic information might have self-organized. In one 
example, he describes a system of buttons connected by strings.?? The 
buttons represent novel genes or proteins and the strings represent self- 
organizational forces of attraction between the proteins. Kauffman 
suggests that when the complexity of this system reaches a critical 
threshold (as represented by the number of buttons and strings), new 
modes of organization might arise in the system “for free”?!—without 
intelligent guidance—similar to the way that water spontaneously 
changes to ice or vapor under specific conditions. 

Kauffman asks his readers to imagine a system of many 
interconnected lights. Each light can flash in a variety of states—on, off, 
twinkling, and so on. Since each light can adopt more than one possible 
state, the system may adopt a vast number of possible states. Further, in 
his system, rules determine how past states influence future states. 
Kauffman asserts that, as a result of these rules, the system would, if 
properly tuned, eventually produce a kind of order in which a few basic 
patterns of light activity recur with greater than random frequency. Since 
these patterns represent a small portion of the total number of possible 
states in which the system can reside, Kauffman suggests that self- 
organizational laws might similarly find highly improbable biological 
outcomes—perhaps even functional sequences of bases or amino acids 
within a much larger sequence space of possibilities.” 

It’s not hard to see why these simulations also would fail to account 
for the origin of the new genes and proteins needed to produce the 
Cambrian animals. In both of his examples Kauffman presupposes 
significant sources of preexisting information. In his buttons and strings 
simulation, he intends the buttons to represent proteins, themselves the 
result of preexisting genetic information. Where did that information 
come from? Kauffman doesn’t say, but it is an essential part of what 
needs explanation in the history of life. Similarly, in his light system, the 
order that allegedly arises “for free’”—that is, apart from an intelligent 





input of information—only does so if, as Kauffman acknowledges, the 
programmer “tunes” the system to keep it from either (a) generating an 
excessively rigid order or (b) devolving into chaos.” This tuning 
presumably involves an intelligent programmer selecting certain 
parameters and excluding others—that is, inputting information. In fact, 
in summarizing the import of this illustration, Kauffman insists that it 
shows how the “orderliness of the cell, long attributed to the honing of 
Darwinian evolution, seems instead likely to arise from the dynamics of 
the genome network,‘ that is, from preexisting—unexplained—sources 
of genetic information. 

In addition, Kauffman’s model systems are not analogous to 
biological systems because they are not constrained by functional 
considerations. A system of interconnected lights governed by 
preprogrammed rules may well settle into a small number of patterns 
within a much larger space of possibilities. But since these patterns have 
no function and need not meet any functional requirements, they have no 
specificity analogous to that in the genes of actual organisms. 
Kauffman’s model systems do not produce sequences or systems 
characterized by specified complexity or functional information. They 
produce modules of repetitive order distributed in an aperiodic manner, 
yielding mere complexity (i.e., information only in the Shannon 
sense).?> Getting a law-governed system to generate repetitive patterns 
of flashing lights, even with a certain amount of variation, is interesting, 
but not biologically relevant. A system of lights flashing “Vote for 
Jones,” on the other hand, would model a biologically relevant outcome, 
at least, if such a functional sequence of letters arose without intelligent 
agents programming the system with equivalent amounts of functionally 
specified information. 


Kau ffman on the Cambrian 


Kauffman also proposes a specific self-organizational mechanism to 
explain some aspects of the Cambrian explosion. According to 
Kauffman, new Cambrian animals emerged through “long-jump” 


mutations that established new body plans in a discrete rather than 
gradual fashion.2° He recognizes that mutations affecting early 
development are almost inevitably harmful.?’ Thus he concludes that 
body plans, once established, will not change, whatever subsequent 
evolution may occur. This keeps his proposal consistent with a top-down 
pattern in the fossil record in which higher taxa (and the body plans they 
represent) appear first, only later to be followed by the multiplication of 
lower taxa representing variations within those original body designs. 

Even so, Kauffman’s proposal begs the most important question: 
What produces the new Cambrian body plans in the first place? By 
invoking “long-jump mutations,’ he identifies no specific self- 
organizational process that can produce such changes. Moreover, he 
concedes a principle that undermines his own proposal. As noted above, 
Kauffman acknowledges that mutations early in development are almost 
inevitably deleterious. Yet developmental biologists know that these are 
the only kind of mutations that have a realistic chance of producing 
large-scale evolutionary change—the big jumps that Kauffman invokes. 
Though Kauffman repudiates the neo-Darwinian reliance upon random 
mutations, he must invoke the most implausible kind of random 
mutation to provide a self-organizational account of the new Cambrian 
body plans. 


DEVELOPMENTAL TOOLKITS AND SELF- 
ORGANIZATIONAL PROCESSES 


More recently, another advocate of self-organization, Stuart Newman, a 
cell biologist at the New York Medical College, has published several 
papers suggesting that self-organizational processes can help explain the 
origin of body plans. In a paper in the volume produced from the 
Altenberg 16 conference, Newman develops a model that resembles 
Kauffman’s, but one that offers more biological specificity.7° 

Newman, like Kauffman, invokes self-organizational processes. But 
Newman sees these processes acting dynamically and in coordination 
with a genetic “toolkit.” His model emphasizes the importance of a 


highly conserved (1.e., similar) set of regulatory genes in all the major 
Cambrian taxa. In his view, this common “developmental genetic 
toolkit””? has been used “to generate animal body plans and organ forms 
for more than half a billion years”?? since the inception of the animal 
kingdom. 

But if all the animal taxa have the same toolkit, why are the various 
forms of animals and higher metazoan taxa so different from one 
another? For Newman, the answer to this question requires 
understanding how self-organizing processes influence the interaction of 
cells during development and how they cause genes to acquire different 
functions affecting the interactions of cells. 

For example, he attributes the emergence of multicellularity to cells 
acquiring the capacity “to remain attached to one another after 
dividing.”>! This capacity in turn derives not from generating new genes 
and proteins (as neo-Darwinism would assume). Instead, it derives from 
the repurposing of old genes and proteins in response to specific self- 
organizational (and epigenetic) processes such as the “physical force of 
adhesion.”>* Newman proposes, further, that once the first multicellular 
organisms had arisen, they would have “set the stage for additional 
physical processes to come into play’”**—processes that could alter the 
expression and function of still other genes in the developmental genetic 
toolkit, resulting in wholly new and different body plans. As Newman 
explains, “The phenomenon of multicellularity opened up possibilities 
for these molecules to become involved in the molding of bodies and 
organs.”>+ 

Newman envisions new animal body plans resulting from different 
cells sticking to each other in different configurations because of 
different forces of attraction between molecules on the surface of cells 
and because of different patterns of distribution of crucial molecules 
within cells. He calls these self-organizational forces and factors 
“dynamical patterning modules” (or DPMs).°> Figure 15.2 shows some 
typical ways that cells cluster together or arrange themselves as the 
result of these self-organizational forces. Newman lists many 
“dynamical patterning modules,” or self-organizational forces, 


responsible for the spontaneous emergence of these different cell 
clusters, including “adhesion, shape and surface polarization, switching 
between alternative biochemical states, biochemical oscillation, and the 
secretion of diffusible and nondiffusible factors.’°° 


| DYNAMICAL PATTERNING MODULES (DPMs) E 


DPM NAME Z  WHATIT DOES WHAT IT LOOKS LIKE 
ADH CAUSES ADHESTON AMONG A GROUP OF 
(ADHESTON) UNAGGREGATED CELLS, ALLOWING FOR 
MULTICELLULARITY. 
LAT TAKES A GROUP OF AGGREGATED CELLS AND 
(LATERAL ALLOWS CO-EXISTENCE OF ALTERNATIVE 
INHIBITION) CELL STATES WITHIN THE GROUP. 
DAD TAKES AN AGGREGATED GROUP OF DIFFERENT 
(DIFFERENTIAL TYPES OF CELLS AND ALLOWS SEPARATION OF 
ADHESTON) CELLS INTO MULTILAYERED TISSUES. 
POL, 
(POLARITY. TAKES A GROUP OF AGGREGATED CELLS AND 
APICAL BAS. AL) CAUSES FORMATION OF INTERIOR CAVITIES. 
POL, TAKES AN AGGREGATED GROUP OF CELLS 
(POLARITY, AND CAUSES ELONGATION OF TISSUES 
PLANAR) WITHIN A PLANE. 
CSc TAKES AN ELONGATED TISSUE AND CHEMICALLY 
(CHEMICAL INDUCES OSCILLATION OF CELL PATTERNS, 





OSCILLATION) ALLOWING SEGMENTATION OF A BODY PLAN. 


FIGURE 15.2 

Dynamical Patterning Modules (DPMs), showing the different ways 
that, according to biologist Stuart Newman, cells can stick to one 
another (“aggregate”) and form structures during animal development. 


To get a handle on what Newman has in mind, think of cells as Lego 
blocks. There are many different ways of connecting Lego blocks, 
depending on the shape of the blocks and the pattern of raised bumps 
and indentations on the blocks. These patterns allow small groups of 
Legos to be arranged into different modular structures: cubes, walls, 
circular rings, and so forth. Each of these smaller modular structures can 
then be combined to make many different larger structures, from 
airplanes and skyscrapers to submarines and castles. In a similar way, 
Newman suggests that different forces of adhesion between cells and 


different patterns of molecular diffusion within and between cells will 
generate many different patterns or motifs of multicellular organization, 
which in turn function as modular elements that can be combined in 
various ways to make diverse animal forms.?” 

Do these self-organizational processes account for the origin of 
animal body plans in the Cambrian explosion or the information 
necessary to produce new animal forms? Again, they do not. Instead, 
Newman, like Kauffman, either fails to offer an adequate mechanism for 
generating crucial sources of biological information, or he begs the 
question by presupposing the existence of various sources of 
information. 


ASSUME A TOOLKIT 


In the first place, Newman obviously presupposes the existence of a 
“developmental genetic toolkit,” that is, a whole set of genes, including 
regulatory genes, that help to direct the development of animal body 
plans. Where does this genetic information come from? He doesn’t 
specify, though presumably he might be assuming the neo-Darwinian 
mechanism somehow produced the genetic information in the toolkit. If 
so, he leaves his model vulnerable to the criticisms outlined in Chapters 
9 through 12. He certainly does not cite any specifically self- 
organizational process to explain the origin of the genetic toolkit. He 
also incorrectly seems to presuppose that the genes present in this 
common toolkit provide all the genetic information necessary to specify 
individual body plans. But this overlooks a host of recent findings 
showing that individual species within specific taxa often require genes 
for development that are specific to those species and taxa.*® Thus, these 
genes would not have been present in a common metazoan toolkit of the 
kind Newman postulates. 

Second, Newman does not account for the origin of the information 
necessary to organize modular arrangements or groups of cells into 
whole animal body plans. The forces at work in his dynamical patterning 
modules explain, at best, only the arrangements of small groups of cells, 


not the arrangements of those modular cell clusters into tissues, organs, 
and whole body plans. 

Think, again, of arranging Lego blocks. There are many ways of 
arranging small numbers of Lego blocks. These various arrangements 
form common structural motifs such as: two blocks stuck together at 
right angles; several curved blocks forming circular rings, stacked 
blocks forming hollow squares or walls or cube-like shapes; blocks 
arranged as prisms or cylinders; flat layers of blocks stacked two bumps 
thick or three bumps thick or more. Though these structural elements 
stick together because of interactions between the bumps and 
indentations on each block, those bumps and indentations themselves do 
not specify any particular larger structure—a castle or an airplane, for 
example—because each motif may be combined or recombined with 
many other structural motifs in numerous different ways. The shape and 
properties of the modular elements do not dictate the type of larger 
structure that must be built from them. Instead, to build a particular 
structure, the modular elements must be arranged in particular ways. 
And since there are many possible ways to arrange these modular 
elements, only one or a few of which will result in a desired structure, 
every Lego set includes a blueprint with step-by-step instructions—in 
other words, additional information. 

In a similar way, producing a body plan from the different types of 
cell clusters generated by Newman’s dynamical patterning modules 
(DPMs), would also require additional information. Newman does not 
account for this information. He correctly highlights the way certain 
recurrent motifs for organizing groups of cells seem to form 
spontaneously as the result of physical interactions between individual 
cells (his DPMs). He does not, however, establish that these groups of 
cells must arrange themselves into specific tissues, organs, or body plans 
in response to any known physical process or law. Instead, it seems 
entirely possible that these modular elements (cell clusters) have many 
“degrees of freedom” and can be arranged in innumerable ways. If so, 
then some additional information—an overall organismal blueprint or 
set of assembly instructions—would need to direct the arrangement of 


these modular elements. Newman does not consider this possibility. Nor 
does he cite any law-like self-organizational process that would 
eliminate the need for such information to direct animal development. 

There is yet a further problem with Newman’s proposal. Even the 
capacity for cells to self-organize into dynamical patterning modules 
probably derives from prior unexplained sources of information. 
Newman’s DPMs undoubtedly form as the result of interactions between 
molecules on the surface of cells, and as the result of chemical gradients 
between cells—with the specific configuration and properties of those 
molecules determining the exact structure of the individual DPMs. In 
that sense, the DPMs do, of course, self-organize, but clearly the specific 
ways that cells typically cluster together will depend upon highly 
specific and complex forces of interaction between the molecules and 
groups of molecules on the surface of these cells. Many of the molecules 
that contribute to these interactions are no doubt proteins, obvious 
products of genetic information. But, in addition, cell-to-cell interactions 
are affected by the arrangement of proteins and other molecules on the 
surface of cells (such as the sugars in the sugar code, see Chapter 14) as 
well as by the arrangement of structures made of proteins. But these 
molecular arrangements are, in turn, specified by either preexisting 
genetic or, more probably, epigenetic sources of information and 
structure. Thus, Newman’s analysis shows that the self-ordering 
tendencies (or biological laws of form), to the extent they exist, depend 
upon preexisting sources of biological information. Newman, again, 
does not explain where that information comes from. 

Newman does emphasize how epigenetic sources of information 
affect the expression and the function of gene products during the 
process of animal development. He notes that different gene products 
may perform different functions depending upon the organismal context 
in which they find themselves. But Newman does not explain with any 
specificity how the genes in the common toolkit acquire different 
functions in response to self-organizational processes or where the 
epigenetic information comes from that determines those functions. Yet, 


clearly context-dependent gene expression depends upon a host of other 
preexisting epigenetic sources of information and structure. 

Recall, for example, the discussion of cell membrane targets from 
Chapter 14. These targets provide an important source of epigenetic 
information by influencing the positioning of crucial morphogenic 
proteins. Yet, the arrangements of the targets on the cell membrane do 
not self-organize as the result of simple chemical interactions between 
the proteins out of which they are made*”—that is, the proteins do not 
determine the location of the membrane targets on the interior of the 
cell. Instead, the location and the structure of membrane targets are 
transmitted from parent membrane to daughter membrane, a process that 
transmits preexisting epigenetic structural information from the parent 
membrane. Newman does not attempt to explain the origin of this 
information or structure by any known self-organizational process. 
Instead, the evidence indicates that interior and exterior membrane 
targets, cytoskeletal arrays, the sugar code, and many other sources of 
epigenetic structural information do not self-organize as the result of 
physical interactions between their respective molecular subunits.*° 


ORDER VS. INFORMATION 


Self-organizational theorists face, in addition, a conceptual distinction 
that has cast doubt on the relevance of their theories to biological 
systems. Self-organizational theorists seek to explain the origin of 
“order” in living systems by reference to purely physical or chemical 
processes (or laws describing those processes). But what needs to be 
explained in living systems is not mainly order in the sense of simple 
repetitive or geometric patterns. Instead, what requires explanation is the 
adaptive complexity and the information, genetic and epigenetic, 
necessary to build it. 

Yet advocates of self-organization fail to offer examples of either 
biological information or complex anatomical structures arising from 
physics and chemistry alone. They either point, as Newman and 
Kauffman do, to embryological development unfolding predictably as 


the result of preexisting information-rich gene products, cell membranes, 
and other cell structures. Or they offer examples of purely physical and 
chemical processes generating a kind of order that has little relevance to 
the features of living systems that most need explanation. 

In the latter case, self-organizational theorists often point to simple 
geometric shapes or repetitive forms of order arising from or being 
modified by purely physical or chemical processes. They suggest that 
such order provides a model for understanding the origin of biological 
information or body-plan morphogenesis.*! Self-organizational theorists 
have cited crystals, vortices, and convection currents (or stable patterns 
of flashing lights) to illustrate the supposed power of physical processes 
to generate “order for free.” Crystals of salt do form as the result of 
forces of attraction between sodium and chloride ions; vortices can 
result from gravitational and other forces acting on water in a draining 
bathtub; convection currents do emerge from warm air (or molten rock) 
rising in enclosed spaces. And some molecules found in living systems 
do adopt highly ordered structures and recognizable geometric shapes as 
the result of the physical interactions of their constituent parts alone. 
Nevertheless, the type of order evident in these molecules or physical 
systems has nothing to do with the specific “order” of arrangement—the 
information or specified complexity—that characterizes the digital code 
in DNA and other higher-level information-rich biological structures. 

This is easiest to see in the case of the information encoded in DNA 
and RNA. Some of what follows may be familiar from my discussion in 
Chapter 8, but it bears repeating. The bases in the coding region of a 
section of DNA or in an RNA transcript are typically arranged in a 
nonrepetitive or aperiodic way. These sections of genetic text display 
what information scientists call “complexity,” not simple “order” or 
“redundancy.” 

To see the difference between order and complexity consider the 
difference between the following sequences: 


Na-Cl-Na-Cl-Na-Cl-Na-Cl 
AZFRT<MPGRTSHKLKYR 


The first sequence, describing the chemical structure of salt crystals, 
displays what information scientists call “redundancy” or simple 
“order.” That’s because the two constituents, Na and Cl (sodium and 
chloride), are highly ordered in the sense of being arranged in a simple, 
rigidly repetitive way. The sequence on the bottom, by contrast, exhibits 
complexity. In this randomly generated string of characters, there is no 
simple repetitive pattern. Whereas the sequence on the top could be 
generated by a simple rule or computer algorithm, such as “Every time 
Na arises, attach a Cl to it, and vice versa,” no rule shorter than the 
sequence itself could generate the second sequence. 

The information-rich sequences in DNA, RNA, and proteins, by 
contrast, are characterized not by either simple order or mere 
complexity, but instead by “specified complexity.” In such sequences, 
the irregular and unpredictable arrangement of the characters (or 
constituents) is critical to the function that the sequence performs. The 
three sequences below illustrate these distinctions: 


Na-Cl-Na-Cl-Na-Cl-Na-Cl (Order) 
AZFRT<MPGRTSHKLKYR (Complexity) 
Time and tide wait for no man (Specified complexity) 


What does all this have to do with self-organization? Simply this: the 
law-like, self-organizing processes that generate the kind of order 
present in a crystal or a vortex do not also generate complex sequences 
or structures; still less do they generate specified complexity, the kind of 
“order” present in a gene or functionally complex organ. 

Laws of nature by definition describe repetitive phenomena—order in 
that sense—that can be described with differential equations or universal 
“if-then” statements. Consider, for example, these informal expressions 
of the law of gravity: “All unsupported bodies fall” or “If an elevated 
body is left unsuspended, then it will fall.” These statements represent 
reasonably accurate law-like descriptions of natural gravitational 
phenomena precisely because we have repeated experience of 


unsupported bodies falling to the earth. In nature, repetition provides 
grist for lawful description. 

The information-bearing sequences in protein-coding DNA and RNA 
molecules do not exhibit such repetitive “order,” however. As such, 
these sequences can be neither described nor explained by reference to a 
natural law or law-like “self-organizational” process. The kind of non- 
repetitive “order” on display in DNA and RNA—a precise sequential 
“order” necessary to ensure function—is not the kind that laws of nature 
or law-like self-organizational processes can—in principle—generate or 
explain. 

Otherwise, the nucleotide bases would repeat rigidly—such as 
ACACACACACACACACAC—in a way that would not allow DNA to 
store or convey specified information. A curious feature of the chemistry 
of DNA allows any one of the four nucleotide bases to attach to any site 
on the interior backbone of the DNA molecule. This chemical 
indeterminacy makes it possible for DNA and RNA to store any one of a 
virtually unlimited number of different arrangements of nucleotide bases 
—in effect, to encode any genetic message. But this indeterminacy also 
categorically defies explanation by deterministic law-like forces of 
chemical attraction. And because forces of attraction do not determine 
the sequence of nucleotide bases in DNA or RNA, the origin of the 
specific arrangement of the bases—the information in DNA and RNA— 
cannot be attributed to self-organizing forces of attraction either. 

Hubert Yockey, a leading innovator in the application of information 
theory to molecular biology, first recognized the problems associated 
with invoking self-organization to explain the origin of biological 
information. These theories fail, he argued, for two reasons. First, they 
do not distinguish order from information. And, second, the information 
in the DNA molecule does not derive from law-like forces of 
attraction.” As he explained in 1977: “Attempts to relate the idea of 
order . . . with biological organization or specificity must be regarded as 
a play on words that cannot stand careful scrutiny. Informational 
macromolecules can code genetic messages and therefore can carry 


information because the sequence of bases or residues is affected very 
little, if at all, by [self-organizing] physicochemical factors.”*? 

Much the same thing is true of many vital sources of epigenetic 
information. The forces of attraction between constituent proteins in 
membrane targets or cytoskeletal arrays, for example, do not determine 
the structure or location of these epigenetic structures and the positional 
information they provide. The origin of these structures cannot be 
attributed to self-organizing forces of attraction either. Instead, in each 
case, information-rich epigenetic structures are generated from 
preexisting sources of epigenetic information. 

Thus, self-organizational theories explain well what does not mainly 
need explaining in biology, namely, repetitive or simple geometric forms 
of order. Self-organizational theorists do cite structures that might have 
self-organized. But these examples are typically extremely modest in 
scope. They include repeating patterns of atoms in crystals; simple 
geometric figures; patterns of lines, triangles, and streaks; vortices; 
spiral wave currents; and simple shapes that glide across computer 
screens.*+ None exhibit the specified complexity that characterizes the 
digital information in DNA and RNA or the complex arrangements of 
proteins, cells, tissues, and organs necessary to build a functioning form 
of animal life. 


NATURAL MAGIC OR TRUE CAUSE? 


In 2007, I participated in a private meeting of evolutionary biologists 
and other scientists who shared the conviction that a new theory of 
biological origins is now needed. In attendance were several prominent 
advocates of the self-organization approach. During the meeting, these 
scientists presented intriguing analogies from physics and chemistry to 
show how order might have arisen “for free’ —that is, without intelligent 
guidance—in the biological realm. Yet the order they described in these 
analogies seemed to have no direct relevance to the complexity—indeed 
the specified complexity—of genes or cell membranes or animal body 
plans. Other scientists at the conference challenged the advocates of self- 


organization to cite known processes that could produce biologically 
relevant form and information. 

Near the end of the meeting one advocate of self-organization 
privately acknowledged to me the validity of these critiques, admitting 
that, for now, “self-organization is really more of a slogan than a 
theory.” Stuart Kauffman, perhaps attempting to make a virtue of the 
necessity of accepting this explanatory deficit, has recently celebrated 
the self-organizational perspective for embracing what he calls “natural 
magic.” In a lecture at MIT, he concluded: “Life bubbles forth in a 
natural magic beyond the confines of entailing law, beyond 
mathematization.”4> He went on to explain that one benefit of the self- 
organizational perspective is that it allows us to be “reenchanted” with 
nature and to “find a way beyond modernity.’*° 

Since the beginning of modern science, scientists have championed a 
commonsense principle of scientific reasoning known as the vera causa 
principle. This principle holds that explaining a particular phenomenon 
or event requires identifying a “true cause” that is known from 
experience to have the power to produce the event or phenomenon in 
question. The early modern scientists affirmed this principle as one of 
the key aspects of a scientific approach to understanding nature. This 
stood in opposition to the magical thinking that had gone before in 
which people attributed powers to nature that they had never observed it 
manifesting. As the scientific revolution matured, endeavors such as 
alchemy, for example, were eventually rejected precisely because the 
alchemists could not identify a cause that could effect the transformation 
they were seeking to demonstrate. 

Self-organizational theories have clearly failed to provide a vera 
causa for the origin of biologically relevant forms of “order’—the 
functional complexity and specified information present in living 
systems. Instead, they either beg the question as to the ultimate origin of 
biological information or point to physical and chemical processes that 
do not produce the specified complexity that characterizes actual 
animals. 





Viewed in this light, Kauffman’s recent discussion of natural magic 
and calls for a “reenchantment” with nature sound less like a bold new 
initiative to reconcile science and spirituality (which is what he 
intended) than a tacit admission that self-organizational theories have 
failed to identify known physical and chemical processes capable of 
generating the form and information present in actual living systems. 
Indeed, after years of attempting to solve the problem of the origin of 
form, Kauffman’s recent ruminations about “natural magic” sound a lot 
like an admission that a profound mystery remains. 


16 
Other Post-Neo-Darwinian Models 


When Stephen Jay Gould was first wrestling with the question of how 
new forms of animal life could have arisen so quickly in the fossil 
record, he considered many possible mechanisms of change. In the 
famed 1980 paper in which he declared neo-Darwinism “effectively 
dead,”! he didn’t just propose allopatric speciation and species 
selections as new evolutionary mechanisms. He also granted a rehearing 
to a long discredited idea. Specifically, he argued that large-scale 
“macromutations” might generate significant innovations in form 
relatively quickly. 

In the 1930s and 1940s, this idea had been associated with 
University of California at Berkeley geneticist Richard Goldschmidt. 
Aware of the many discontinuities in the fossil record, Goldschmidt 
envisioned radical transformations in the form of animals arising in 
even one generation as the result of such large-scale mutations. He 
endorsed, for instance, the view of the German paleontologist Otto 
Schindewolf (1896—1971) that “the first bird hatched from a reptilian 
egg” and, thus, in Goldschmidt’s words, “that the many missing links in 
the paleontological record are sought for in vain because they have 
never existed.”* If a bird hatched directly from a reptilian egg as the 
result of heritable, large-scale mutations, then such a sudden leap or 
“saltation” would obviously leave no fossil intermediates behind. 

Neo-Darwinists rejected this idea as biologically implausible in the 
extreme. They argued that changing so many functionally integrated 
anatomical and physiological systems so quickly would inevitably result 


in deformed mutants, not different integrated systems of organs 
constituting a whole new animal.* Goldschmidt’s macromutations, they 
contended, would produce not what Goldschmidt called “hopeful 
monsters,” but “hopeless monsters”—that is, nonviable organisms.” 

Though Gould wanted to reconsider a role for large-scale mutations, 
he carefully disassociated his proposal from Goldschmidt’s much 
ridiculed idea. Instead, he suggested that the mutations affecting genes 
in animal development might generate larger increments of 
morphological innovation than the mutations that affected other genes. 
These “developmental mutations,” he thought, might generate modular 
parts of biological systems in a short time—without needing to generate 
whole new forms of life in a single generation. He offered, as an 
example, the possibility that the gill arch bones of ancient jawless fish, 
though not the whole fish, might have arisen in one step as the result of 
a developmental macromutation. Gould explained: “I do not refer to the 
saltational origin of entire new designs, complete in all their complex 
and integrated features. . . . Instead, I envisage a potential saltational 
origin for the essential features of key adaptations.” 

In response to heavy criticism from neo-Darwinists, Gould later 
downplayed the role of such larger-scale developmental mutations in the 
theory of punctuated equilibrium. Other evolutionary biologists, 
however, took his idea as an inspiration and developed theories that 
emphasize such developmental mutations as a driving force in 
macroevolution. Evolutionary theorists and developmental biologists 
such as Rudolf Raff, Sean B. Carroll, and Wallace Arthur have 
developed a subdiscipline of biology known as evolutionary 
developmental biology, or “evo-devo” for short. The evolutionary 
developmental biologists have since formulated alternative models that 
challenge a key aspect of the neo-Darwinian triad. Whereas neo- 
Darwinism envisions new form arising as the result of slow, incremental 
accumulations of minor mutations, evolutionary developmental 
biologists argue that mutations affecting genes involved in animal 
development can cause large-scale morphological change and even 
whole new body plans. 


This chapter will not only examine the ideas of evolutionary 
developmental biologists, but three of the other most prominent 
alternatives to neo-Darwinism, some proposed by members of the 
Altenberg 16 (see Fig. 16.1). Each of these alternatives emphasizes 
certain elements of the “triad” at the expense of others. Whereas the 
self-organizational alternatives that I discussed in the last chapter 
emphasize the role of law-like processes over random mutations, these 
other new theories reaffirm the importance of mutations, though each 
also reconceptualizes how mutations act. One approach falls under the 
rubric of “evo-devo” and conceives of mutations producing 
modifications in larger increments. Another, the neutral theory of 
evolution, sees mutations acting absent selection. Another, neo- 
Lamarckian “epigenetic inheritance,” envisions heritable alterations in 
epigenetic information influencing the future course of evolution. Still 
another, called “natural genetic engineering,” affirms that nonrandom 


genetic rearrangements drive evolutionary innovation.” 
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FIGURE 16.1 


The tenets of neo-Darwinian orthodoxy and the different ways that 
various non-Darwinian models of evolution deviate from those tenets. 
The boxes representing new evolutionary models are positioned under 
the headings of the neo-Darwinian tenets that they challenge. 


Let’s see if one of these proposals solves the twin problems of the 
origin of form and information and whether, therefore, it might also help 
to resolve the mystery of the Cambrian explosion.® 


EVO-DEVO AND ITS PROPOSALS 


The neo-Darwinian synthesis has long emphasized that large-scale 
macroevolutionary change occurs as the inevitable by-product of the 
accumulation of small-scale “microevolutionary” changes within 
populations. The consensus in support of this idea began to fray in 
evolutionary biology during the early 1970s, when young 
paleontologists such as Gould, Niles Eldredge, and Steven Stanley 
realized that the fossil record did not show a pattern of gradual “micro- 
to-macro” change. In 1980, at a now famous symposium on 
macroevolution at the Field Museum in Chicago, the rebellion burst into 
full view, exposing what developmental biologist Scott Gilbert called 
“an underground current in evolutionary theory” among theorists who 
had concluded that “macroevolution could not be derived from 
microevolution.”” 

At the conference, paleontologists who doubted the “micro-to- 
macro” consensus found allies among younger developmental 
biologists. They were dissatisfied with neo-Darwinism in part because 
they knew that population genetics, its mathematical expression, sought 
only to quantify changes in gene frequency rather than explain the 
origin of genes or novel body plans. Thus, many developmental 
biologists thought that neo-Darwinism did not offer a compelling theory 
of macroevolution.!° 

To formulate a more robust theory, many developmental biologists, 
such as Rudolf Raff, a developmental biologist at the University of 


Indiana and one of the founders of “evo-devo,” urged evolutionary 
theorists to incorporate insights from their discipline.!! For example, 
developmental biologists know that mutations expressed early in the 
development of animals are necessary to alter body-plan 
morphogenesis. Thus, they argue that these mutations must have played 
a significant role in generating whole new animal forms during the 
history of life. They assert that this understanding of developmental 
processes is crucial to understanding animal evolution. Some evo-devo 
advocates such as Sean B. Carroll and Jeffrey Schwartz have pointed 
specifically to homeotic (or Hox) genes—master regulatory genes that 
affect the location, timing, and expression of other genes—as entities 
capable of producing such large-scale change in animal form.!* These 
evo-devo advocates have broken with classical neo-Darwinism 
primarily in their understanding of the size or increment of mutational 
change. 


MAJOR BUT NOT VIABLE, VIABLE BUT NOT MAJOR 


Despite the enthusiasm surrounding the field, evo-devo fails, and for an 
obvious reason: its main proposal, that early-acting developmental 
mutations can cause stably heritable, large-scale changes in animal body 
plans, contradicts the results of one hundred years of mutagenesis 
experiments.!> As we saw in Chapter 13, the experiments of scientists 
such as Nusslein-Volhard and Wieschaus have shown definitively that 
early-acting body-plan mutations invariably generate embryonic lethals 
—dead animals incapable of further evolution. The results of these 
experiments have generated the dilemma for evolutionary biologists that 
geneticist John McDonald aptly described as the “great Darwinian 
paradox.” Recall that McDonald noted that early-acting regulatory 
mutations do not produce viable alterations in form that will persist in 
populations, as evolution absolutely requires. Instead, these mutations 
are eliminated immediately by natural selection because of their 
invariably destructive consequences. On the other hand, later-acting 
mutations can generate viable changes in the features of animals, but 


these changes do not affect global animal architectures. This generates a 
dilemma: major changes are not viable; viable changes are not major. In 
neither case do the kinds of mutation that actually occur produce viable 
major changes of the kind necessary to build new body plans. 

In 2007, I coauthored a textbook with several colleagues titled 
Explore Evolution. In it, we explained this “either/or” (“mayjor-not- 
viable, viable-not-major”’) dilemma and suggested that it posed a 
challenge to theories that rely on the mutation and selection mechanism 
to explain the origin of major morphological changes.'* The National 
Center for Science Education (NCSE)—an influential activist group that 
opposes allowing students to learn about scientific criticisms of 
evolutionary theory—challenged our critique. They charged that our 
textbook “fails to acknowledge the extensive research on mutations in 
DNA sequences that do not encode proteins, but which have important 
morphological effects.”!> In other words, they claimed that some viable 
mutations do produce major large-scale changes. 
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FIGURE 16.2 

Superficial changes in insect wing coloration thought to be caused by 
mutations in cis-regulatory elements. Such examples show that 
mutations that affect development and that also result in viable offspring 
tend to be minor. Courtesy National Academy of Sciences, U.S.A. 


The NCSE cited papers from the “evo-devo” literature claiming that 
a type of mutation in the regulatory regions of the genome, “cis- 
regulatory” regions, have been shown to produce large-scale changes in 
winged insects. According to the NCSE, mutations in these cis- 
regulatory elements (or CREs) are “considered by many evolutionary 
biologists to have the greatest potential for generating evolutionary 
change.”!© What’s more, they insisted that “mutations in CREs play an 
important role in morphological evolution.”!’ The NCSE cited a paper 
in the Proceedings of the National Academy of Sciences by three 


developmental biologists, Benjamin Prud’homme, Nicolas Gompel, and 
Sean B. Carroll.!® 

The paper did not show what the NCSE claimed, however. It did 
assert that changes in regulatory DNA produce “both relatively modest 
morphological differences among closely related species and more 
profound anatomical divergences among groups at higher taxonomical 
levels.”!? But the study only showed how changes in the cis-regulatory 
elements in fruit fly DNA might have affected the coloration of wing 
spots in several different types of flying insects. It did not report any 
significant change in the form or body plan of these insects. Instead, the 
study highlighted a clear case of a viable mutation generating merely a 
minor or superficial change (see Fig. 16.2). 

Not surprisingly, many evolutionary biologists recognize that such 
regulatory mutations do not explain the evolution of new body plans. 
For example, Hopi Hoekstra, of Harvard University, and Jerry Coyne, 
two traditional neo-Darwinists, have published an article reviewing 
various evo-devo proposals in the journal Evolution. They note, 
“Genomic studies lend little support to the cis-regulatory theory” of 
evolutionary change. 

They also argue, tellingly, that most cis-regulatory mutations result 
in the Joss of genetic and anatomical traits, including a famous case in 
which evolutionary biologists attributed the loss of pelvic spines in 
stickleback fish to mutations in cis-regulatory elements.*° Yet, as they 
argue, “supporting the evo-devo claim that cis-regulatory changes are 
responsible for morphological innovations requires showing that 
promoters are important in the evolution of new traits, not just the losses 
of old ones.” Hoekstra and Coyne conclude, “There is no evidence at 
present that cis-regulatory changes play a major role—much less a pre- 
eminent one—in adaptive evolution.”*! Given their commitment to neo- 
Darwinism, it’s fair to assume that Hoekstra and Coyne probably did not 
intend, in making this argument, to refute the NCSE’s criticism of our 
textbook Explore Evolution. Nevertheless, science, like politics, 
sometimes makes for strange bedfellows. 





WHAT ABOUT HOX GENES? 


When biology students hear my colleague Paul Nelson describe the 
“great Darwinian paradox” (see Chapter 13) in public lectures on 
university campuses, they often ask, “What about Hox genes?” Recall 
that Hox (or homeotic) genes regulate the expression of other protein- 
coding genes during the process of animal development. Some 
biologists have likened them to the conductor of an orchestra who plays 
the role of coordinating the contributions of the players. And because 
Hox genes affect so many other genes, many evo-devo advocates think 
that mutations in these genes can generate large-scale changes in form. 

For example, Jeffrey Schwartz, at the University of Pittsburgh, 
invokes mutations in Hox genes to explain the sudden appearance of 
animal forms in the fossil record. In his book Sudden Origins, Schwartz 
acknowledges the discontinuities in the fossil record. As he notes, “We 
are still in the dark about the origin of most major groups of organisms. 
They appear in the fossil record as Athena did from the head of Zeus— 
full-blown and raring to go, in contradiction to Darwin’s depiction of 
evolution as resulting from the gradual accumulation of countless 
infinitesimally minute variations.” 

What resolves this mystery? Schwartz, an evo-devo advocate, 
reveals his answer: “A mutation affecting the activity of a homeobox 
[Hox] gene can have a profound effect—such as turning . . . larval 
tunicates into the first chordates. Clearly, the potential homeobox genes 
have for enacting what we call evolutionary change would seem to be 
almost unfathomable.” 

But can mutations in Hox genes transform one form of animal life— 
one body plan—into another? There are several reasons to doubt that 
they can. 

First, precisely because Hox genes coordinate the expression of so 
many other different genes, experimentally generated mutations in Hox 
genes have proven harmful. William McGinnis and Michael Kuziora, 
two biologists who have studied the effects of mutations on Hox genes, 
have observed that in fruit flies “most mutations in homeotic [Hox] 


genes cause fatal birth defects.” In other cases, the resulting Hox 
mutant phenotype, while viable in the short term, is nonetheless 
markedly less fit than the wild type. For example, by mutating a Hox 
gene in a fruit fly, biologists have produced the dramatic Antennapedia 
mutant, a hapless fly with legs growing out of its head where the 
antennae should be (see Fig. 16.3). Other Hox mutations have 
produced fruit flies in which the balancers (tiny structures behind wings 
that stabilize the insect in flight, called “halteres”) are transformed into 
an extra pair of wings.*° Such mutations alter the structure of the 
animal, but not in a beneficial or permanently heritable way. The 
Antennapedia mutant cannot survive in the wild; it has difficulty 
reproducing, and its offspring die easily. Similarly, fruit-fly mutants 
sporting an extra set of wings lack the musculature to make use of them 
and, absent their balancers, cannot fly. As Hungarian evolutionary 
biologist Eors Szathmary notes with cautious understatement in the 
journal Nature, “macromutations of this sort [1.e., in Hox genes] are 
probably frequently maladaptive.”/ 





FIGURE 16.3 

Photograph of an Antennapedia mutant with a pair of legs growing out 
of its head, where antennae would normally develop. Such examples 
show that mutations that occur early in animal development and that 
also produce major changes typically result in less fit offspring—in this 
case offspring that cannot reproduce. Courtesy Elsevier, Inc. 


Second, Hox genes in all animal forms are expressed after the 
beginning of animal development, and well after the body plan has 
begun to be established. In fruit flies, by the time that Hox genes are 
expressed, roughly 6,000 cells have already formed, and the basic 
geometry of the fly—its anterior, posterior, dorsal, and ventral axes—is 
already well established.2® So Hox genes don’t determine body-plan 
formation. Eric Davidson and Douglas Erwin have pointed out that Hox 
gene expression, although necessary for correct regional or local 
differentiation within a body plan, occurs much later during 
embryogenesis than global body-plan specification itself, which is 
regulated by entirely different genes. Thus, the primary origin of animal 
body plans in the Cambrian explosion is not merely a question of Hox 
gene action, but of the appearance of much deeper control elements— 
Davidson’s “developmental gene regulatory networks” (dGRNs).7? And 
yet, as we saw in Chapter 13, Davidson argues that it is extremely 
difficult to alter dGRNs without damaging their ability to regulate 
animal development. 

Third, Hox genes only provide information for building proteins that 
function as switches that turn other genes on and off. The genes that 
they regulate contain information for building proteins that form the 
parts of other structures and organs. The Hox genes themselves, 
however, do not contain information for building these structural parts. 
In other words, mutations in Hox genes do not have all the genetic 
information necessary to generate new tissues, organs, or body plans. 

Nevertheless, Schwartz argues that biologists can explain complex 
structures such as the eye just by invoking Hox mutations alone. He 
asserts that “[t]here are homeobox genes for eye formation and that 
when one of them, the Rx gene in particular, is activated in the right 
place and at the right time, an individual has an eye.”?? He also thinks 
that mutations in Hox genes help arrange organs to form body plans. 

In a review of Schwartz’s book, Eörs Szathmary finds Schwartz’s 
reasoning deficient. He too notes that Hox genes don’t code for the 
proteins out of which body parts are made. It follows, he insists, that 
mutations in Hox genes cannot by themselves build new body parts or 


body plans. As he explains, “Schwartz ignores the fact that homeobox 
genes are selector genes. They can do nothing if the genes regulated by 
them are not there.”*! Though Schwartz says he has “marveled” at “the 
importance of homeobox genes in helping us to understand the basics of 
evolutionary change,”** Szathmary doubts that mutations in these genes 
have much creative power. After asking whether Schwartz succeeds in 
explaining the origin of new forms of life by appealing to mutations in 
Hox genes, Szathmary concludes, “I’m afraid that, in general, he 
does not.’””*? 

Nor, of course, do Hox genes possess the epigenetic information 
necessary for body-plan formation. Indeed, even in the best of cases 
mutations in Hox genes still only alter genes. Mutations in Hox genes 
can only generate new genetic information in DNA. They do not, and 
cannot, generate epigenetic information. 

Instead, epigenetic information and structures actually determine the 
function of many Hox genes, and not the reverse. This can be seen when 
the same Hox gene (as determined by nucleotide sequence homology) 
regulates the development of different anatomical features found in 
different phyla. For instance, in arthropods the Hox gene Distal-less is 
required for the normal development of jointed arthropod legs. But in 
vertebrates a homologous gene (e.g., the D/x gene in mice) builds a 
different kind of (nonhomologous) leg. Another homologue of the 
Distal-less gene in echinoderms regulates the development of tube feet 
and spines—anatomical features classically thought not to be 
homologous to arthropod limbs, nor to limbs of tetrapods.** In each 
case, the Distal-less homologues play different roles determined by the 
higher-level organismal context. And since mutations in Hox genes do 
not alter higher-level epigenetic contexts,’ they cannot explain the 
origin of the novel epigenetic information and structure that establishes 
the context and that is necessary to building a new animal body plan.*° 


NEUTRAL OR NONADAPTIVE EVOLUTION 


Michael Lynch, a geneticist at Indiana University, has offered a different 
mechanism of evolutionary change, and a different explanation for the 
origin (or growth) of the genome as well as the origin of anatomical 
novelty. Lynch proposes a neutral or “non-adaptive” theory of evolution 
in which natural selection plays a largely insignificant role. His theory is 
based on contrasting observations about the features and strength of 
evolutionary mechanisms at work in populations of different sizes. 

He observes, first, that in general, the larger the population of 
organisms, the lower the mutation rate and (in sexually reproducing 
eukaryotes) the higher the rate of genetic recombination. He notes that 
the genomes of organisms in larger populations (such as those of 
bacteria and unicellular eukaryotic organisms) tend to be smaller and 
more streamlined—meaning they have fewer intervening nonprotein- 
coding sequences (i.e., introns). Most important, he notes that in large 
populations, natural selection tends to be relatively effective in 
eliminating deleterious mutations and fixing beneficial ones, whereas 
the process of genetic drift (the tendency for gene variants to be lost 
through random processes) plays a relatively less significant role. 

By contrast, Lynch observes that small populations—which would 
include almost all animal groups—are characterized by higher mutation 
rates and lower rates of genetic recombination. They also tend to have 
large genomes with a lot of nonprotein-coding DNA—introns, 
pseudogenes, transposons and various repetitive DNA elements—as 
well as gene duplicates. Lynch argues that in these small populations, 
natural selection tends to be weak—unable to remove mildly deleterious 
mutations or to fix mildly beneficial ones efficiently. As Lynch 
summarizes, “Three factors (low population sizes, low recombination 
rates and high mutation rates) conspire to reduce the efficiency of 
natural selection with increasing organism size.”>’ Consequently, 
nonprotein-coding elements are not removed from the genome, but 
instead tend to accumulate, causing the genomes of organisms living in 
small populations to grow—even though these sequences may be 
neutral or even deleterious. Moreover, in small populations, “neutral” 


processes such as random mutation, genetic recombination, and genetic 
drift predominate in their effects over natural selection. 

What does all this have to do with the origin of animals and the 
Cambrian explosion? Evolutionary biologists think that the ancestral 
groups of the Cambrian animals would likely have existed in relatively 
small populations. Lynch argues that in small populations, animal 
genomes will inevitably grow over time as nonprotein-coding sections 
of DNA (as well as gene duplicates) accumulate due to the weakness of 
natural selection. He thinks these neutral mutations drive the evolution 
and growth of genomic and phenotypic complexity in animals. In short, 
Lynch attempts to explain the expansion of the genome and the origin of 
anatomical complexity as the result of neutral, non-adaptive processes 
of genetic accretion, rather than as an adaptive process involving natural 
selection acting on random mutations. As he states, “Many of the 
unique complexities of the eukaryotic gene arose by semi-neutral 
processes with little, if any, direct involvement of positive selection.”*® 

In his work, Lynch has also advanced a powerful mathematical 
critique of the efficacy of the neo-Darwinian mechanism. He has argued 
that natural selection plays a lesser role in shaping the features of 
evolving populations than many evolutionary theorists have previously 
assumed—especially in the case of relatively small populations. Lynch 
has argued, instead, that random environmental factors—an organism 
being in the right place at the right time (near an abundant food source, 
for example) or in the wrong place at the wrong time (in a drought- 
stricken region or near an erupting volcano, for example)—will play a 
more important role in determining reproductive success than variations 
in the fitness of organisms within the same population. 


COUNTERINTUITIVE APPROACH 


Lynch develops his mathematical critique of the creative power of 
natural selection based on the principles of population genetics. 
Nevertheless, it does not follow from his analysis showing the weakness 
of natural selection that neutral processes alone are sufficient to build 


new functional genes and proteins. Nor does it follow that neutral 
processes alone can account for the many complex anatomical systems 
that require new sources of genetic (and epigenetic) information for 
their construction. Indeed, it may seem counterintuitive, at least from a 
neo-Darwinian point of view, to think that the accumulation of random 
mutations alone can accomplish what neo-Darwinists have long invoked 
both mutations and natural selection to do. In effect, Lynch’s theory 
attempts to explain the origin of anatomical complexity by reference to 
what would seem on its face to be a less—not a more—potent 
mechanism than the one offered by neo-Darwinism. Could such a 
counterintuitive theory be correct? 

Perhaps, but as a comprehensive theory of how biological 
information and anatomical complexity arises, Lynch’s neutral theory 
leaves much to be desired. 

In the first place, Lynch’s theory offers no explanation for some of 
the crucial molecular machinery present in eukaryotes—machinery that 
is necessary to rendering his mechanism for the accumulation and 
subsequent expression of genetic information credible. Recall that 
Lynch thinks that small populations of multi-cellular organisms in 
particular would have inevitably accumulated many insertional genomic 
elements. But for the functional information in these growing genomes 
to be expressed, the cell must have some way of excising the non- 
functional randomly accreting genetic elements—at least, until some of 
them mutate to the point that they contribute to producing functional 
genes and proteins. 

Extant eukaryotic organisms depend on a sophisticated molecular 
machine called a spliceosome—a machine that excises introns and fuses 
together exons (the portions of the genome that code for proteins) 
before gene expression takes place. “This large complex,” observes cell 
biologist Melissa Jurica, is “composed of over 150 individual proteins” 
and several structural RNAs, and thus “may indeed deserve the moniker 
‘the most complicated macromolecular machine in the cell.’ °° 

So where do spliceosomes, and the genes necessary to produce them, 
come from? Lynch doesn’t say, though he recognizes, of course, the 


importance of this molecular machinery to gene expression and to his 
scenario. As he explains, “The problem is that introns are inside genes 
and get transcribed to mRNA but then have to be spliced out perfectly. 
If you’re one nucleotide off, you get a dead transcript.’”*? Nevertheless, 
Lynch’s theory presupposes, but does not explain, the origin of the 
genetic information necessary to produce the spliceosomes that perform 
this function. He certainly does not explain the origin of these massive 
multi-protein, multi-RNA complexes by reference to any neutral 
evolutionary process. Nor can he, since his theory of genomic accretion 
and expression presupposes the existence of precisely such intricate 
machines. Instead, as my colleague Paul Nelson has put it rather 
colorfully, “to get Lynch’s theory of genomic accretion up and running, 
a great deal of complicated molecular machinery must be rolled in from 
offstage.” 

Of course, it could be argued that these machines and systems arose 
much earlier with the origin of the eukaryotic cell as the result of 
selection-driven evolution in the large populations of simpler unicellular 
organisms in which, according to Lynch’s theory, natural selection 
played a more significant role. Nevertheless, Lynch does not make that 
argument—and for good reason. Most evolutionary biologists today 
recognize the origin of the eukaryotic cell as a completely unsolved 
problem—unexplained by either neutral or adaptive theories of 
evolution?! 

Of course, insofar as these molecular machines are present in even 
one-celled eukaryotic organisms, they would have arisen, presumably, 
well before the origin of animals. Thus, explaining their origin is not, 
strictly speaking, directly relevant to explaining the Cambrian 
explosion. Nevertheless, Lynch’s inability to account for their origin 
reflects directly on the credibility of his theory—at least insofar as it 
seeks to offer a comprehensive account of the mechanisms by which 
biological information and complexity arise during the history of life. 


Dri fing In and Out 


In any case, there are good reasons to doubt that Lynch’s neutral 
mechanism could generate the novel biological information and form 
necessary to explain the origin of animals, even granting the prior 
existence of the molecular machinery (in small populations of 
eukaryotic organisms) that his scenario requires. 

First, Lynch assumes a false gene-centric view of the origin of 
biological form. As he writes: “Most of the phenotypic diversity that we 
perceive in the natural world is directly attributable to the peculiar 
structure of the eukaryotic gene.” His view overlooks the crucial role 
of epigenetic information and structure in the origin of animal form 
discussed in Chapter 14 and, therefore, does nothing to explain its 
origin. 

Second, neutral processes such as genetic drift do not favor 
beneficial mutations, and thus do not fix, with any efficiency, those 
mutation-induced genetic traits in small populations.** Natural 
selection, as we saw in Chapter 10, is something of a double-edged 
sword. On the one hand, natural selection helps to fix beneficial traits in 
a population. On the other hand, natural selection also makes it difficult 
for functional genes to vary widely without being eliminated. Neutral 
theories of evolution attempt to avoid the latter problem by invoking 
gene duplication and other processes that can add nonfunctional 
sequences to the genome—sequences that are unaffected, at least 
initially, by selective pressure. In so doing, however, these theoretical 
formulations significantly diminish the role of natural selection as a 
mechanism that can fix beneficial mutations in place once they have 
arisen. Thus, in all neutral theories, including Lynch’s, any beneficial 
mutations that arise and begin to drift through a population, can just as 
readily—without a significant influence from natural selection to 
impede it—drift out of a population as well. This limitation vastly 
increases the time it will take for neutral processes to fix beneficial 
genetic changes in a population. Both skeptics and proponents of neo- 
Darwinism have recognized this deficiency in Lynch’s model.“ 

Third, and most important, Lynch’s theory not only fails to account 
for the fixation of new genes and traits in small populations, it also fails 


to account for their origin. Lynch’s mechanism of neutral mutational 
change does envision the addition of brute genomic complexity as the 
result of the accretion of preexisting genetic elements (introns, 
transposons, pseudogenes and gene duplicates). Nevertheless, the 
addition of these elements does not generate any novel functional (or 
specified) genetic information. Instead, it merely transfers preexisting 
genetic sequences from one organismal context where those sequences 
may have performed a function, to another where they likely will not. 
Indeed, the point of neutral theory is to postulate the addition of genetic 
elements that, initially, do not perform crucial functions such that they 
can experience mutations without deleterious consequence to the 
organism. Lynch himself assumes that these added elements will not 
perform functions in their new context, which is why he envisions the 
need for spliceosomes to excise them, at least initially. 

Instead, for Lynch’s theory to explain the origin of new and 
functional genes and proteins (and the anatomical complexities that 
depend on them), his theory would have to solve the problem of 
combinatorial inflation discussed in Chapter 10. He would have to show 
that purely random mutations could efficiently search the relevant 
combinatorial space of possible sequences corresponding to a given 
novel functional gene or protein. 

Nevertheless, Lynch does not even address the problem of 
combinatorial inflation or the closely related problem of the rarity of 
genes and proteins in sequence space. He provides no experimental 
evidence that recombination and/or mutation (given genetic drift) will 
actually produce functional or specified genetic complexity. Instead, the 
examples he provides are entirely hypothetical. In addition, he offers no 
reason to think that the probability of a successful search for functional 
genes or proteins would be any higher (1.e., more likely to occur) than 
the probabilities calculated in Chapter 10. He does not, therefore, 
answer the challenge of the problem of combinatorial inflation and the 
rarity of functional genes and proteins in sequence space. 

Lynch does provide, perhaps, a more detailed characterization than 
other neutral theories of where neutral, nonadaptive processes must 


predominate. Nevertheless, he does not show that such processes— 
random genetic mutations unhinged from natural selection—are 
sufficient to generate novel functional genes and proteins, let alone 
complex anatomical novelties requiring the origin of many such genes 
and proteins. Instead, as Axe’s experimental results have shown, 
random mutations of whatever kind will not generate enough trials to 
render probable (or plausible) a successful search of the sequence space 
corresponding to a given functional gene or protein. 


LYNCH AND WAITING TIMES 


Lynch does argue in one paper that neutral evolutionary processes can 
generate new complex adaptations—adaptations requiring multiple 
coordinated mutations—within realistic waiting times. In particular, 
writing in a recent paper with colleague Adam Abegg of St. Louis 
University, he argues that “conventional population genetic 
mechanisms” such as random mutation and genetic drift can cause the 
“relatively rapid emergence of specific complex adaptations.”“? Lynch 
makes two specific claims in this regard. First, he claims that in large 
populations, arbitrarily complex adaptations can occur if the mutational 
intermediates are neutral in their effects on the organism. That is, Lynch 
purports to show that in large microbial populations, complex 
adaptations requiring a virtually unlimited number of mutations can 
occur within realistic waiting times. According to Lynch, this can occur 
provided that each mutation in a series of mutations has neutral (but not 
deleterious) effects on the organism. Second, Lynch argues that even 
though it generally takes longer to build complex traits in small 
populations, such traits can still evolve within realistic waiting times 
provided, again, that the mutational intermediates are neutral in their 
effects. In fact, he concludes that “the elevated power of both random 
genetic drift and mutation may enable the acquisition of complex 
adaptations in multicellular species at rates that are not appreciably 
different from those achievable in enormous microbial populations.” 


Though Lynch makes these assertions in the context of a densely 
mathematical scientific article, the significance of his claims, if true, can 
hardly be overstated. In essence, he claims that his population genetics— 
based mathematical model shows that purely random mutations and 
genetic drift can generate extremely complex adaptations in realistic 
waiting times—that his neutral evolutionary theory solves the problem 
of complex adaptations and long expected waiting times discussed in 
Chapter 12. 

But some things are just too good to be true, and it turns out that 
Lynch and Abegg made a subtle but fundamental mathematical error in 
coming to their conclusion. Appropriately, perhaps, the first person to 
demonstrate that Lynch’s incredible claim was problematic was Douglas 
Axe. Although Axe could see that much of the math in Lynch’s paper 
with Abegg was correct, Axe suspected from his own calculations and 
experiments that they had made some crucial error. In the end, he traced 
Lynch and Abegg’s claims to two erroneous equations, both of which 
were based on an erroneous assumption. In essence, Lynch and Abegg 
assumed that organisms will acquire a given complex adaptation by 
traversing a direct path to the new anatomical structure. Each mutation 
would build on the previous one in the most efficient manner possible— 
with no setbacks, false starts, aimless wandering, or genetic degradation 
—until the desired structure or system (or gene) is constructed. Thus, 
they formulated an undirected model of evolutionary change, and one 
that assumes, moreover, that there is no mechanism available (such as 
natural selection) that can lock in potentially favorable mutational 
changes on the way to some complex advantageous structure. 
Nevertheless, they calculated the waiting times required to produce such 
structures as if a process for locking in potentially advantageous 
changes did exist, and as if their undirected and purely random 
mechanism was in some way directed to these functionally propitious 
outcomes. As Axe notes in a trenchant mathematical critique of Lynch 
and Abegg’s argument, “Of all the possible evolutionary paths a 
population can take, the analysis of Lynch and Abegg considers only 


those special paths that lead directly to the desired end—the complex 
adaptation.’ 

Yet nothing in Lynch’s neutral model ensures that potentially 
advantageous mutations will remain in place while other mutations 
accrue. As Axe explains, “Productive changes cannot be ‘banked’, 
whereas Equation 2 [one of Lynch’s equations] presupposes that they 
can.” Instead, Axe shows, mathematically, that degradation (the 
fixation of mutational changes that make the complex adaptation less 
likely to arise) will occur much more rapidly than constructive 
mutations, causing the expected waiting time to increase exponentially. 

The illustration I developed in Chapter 10 to explain the problem 
facing neutral models of gene evolution may help illuminate the 
mistaken assumption underlying Lynch and Abegg’s calculations. 
Recall from that chapter that my hypothetical blindfolded man dropped 
in the middle of an enormous shark-free pool did not face any predators 
(by analogy the purifying effects of natural selection). But he still faced 
the problem of finding the ladder at the edge of the enormous body of 
water (by analogy, the need to search an enormous number of possible 
mutational paths and possible sequences to find the rare functional 
ones). Now suppose someone were to calculate how long on average it 
would take for the blindfolded man to swim to the ladder at the edge of 
the enormous pool. If someone simply divided the distance to the ladder 
at the edge of the pool by the maximum speed that man could swim, he 
or she would get a fantastically optimistic estimate of the severity of the 
problem facing our unfortunate swimmer. Why? Because calculating the 
probable waiting time in this manner would overlook the main problem 
the man faced, namely, the man does not know where the ladder is or 
how to get there. Nor does he have any way to gauge his progress. 

Thus, any realistic estimate of how long it will actually take him to 
swim to the ladder—as opposed to an estimate of the theoretically 
fastest route possible—must take into account his probable aimless 
wandering, fits and starts, swimming in circles and drifting in various 
directions. Similarly, Lynch and Abegg fail to reckon in their calculation 
on the random, undirected and, literally, aimless nature of the 


mechanism that they propose. Instead, they mistakenly assume that 
neutral processes of evolution will make a beeline for some specific 
complex adaption. In fact, these processes will—in all probability—also 
wander aimlessly in a vast sequence space of neutral, functionless 
possibilities with nothing to direct them, or preserve them in any 
forward progress they happen to make, toward the rare and isolated 
islands of function represented by complex adaptations. For this reason, 
Lynch vastly underestimates the waiting times required to generate 
complex adaptations and, therefore, does not solve the problem of the 
origin of genes and proteins or any other complex adaptation. Instead, 
Axe shows in his own mathematical model, which accompanies his 
critique of Lynch, that the waiting times problem is just as severe as he 
(Axe) had previously calculated. 


NEO-LAMARCKIAN EPIGENETIC INHERITANCE 


The third element of the neo-Darwinian triad concerns the transmission 
and inheritance of genetic information. Not surprisingly, a new version 
of evolutionary theory questions the neo-Darwinian understanding of 
heredity as well. 

Darwin himself lacked an accurate theory of how features of 
organisms are transmitted from one generation to the next. He thought 
that changes in organisms that occurred during their lifetimes, as the 
result of the “use and disuse” of different organs and anatomical 
systems, would be transmitted to offspring through reproduction.*? In 
this respect, his theory of inheritance resembled that of an earlier 
evolutionary theorist, Jean-Baptiste de Lamarck (1744-1829), who also 
believed in the inheritance of acquired characteristics. 

Lamarckian mechanisms, although unsupported by any evidence at 
the time, came to play an increasingly important role in Darwin’s 
thinking, as criticisms of natural selection caused Darwin to place more 
weight on the direct influence of the environment in evolutionary 
change. Indeed, by the sixth edition of the Origin (1872), Darwin 
specifically emphasized the importance of these modes of inheritance.>? 


But with the rediscovery of Mendel’s laws in 1900 and the 
identification of chromosomes as the material entity responsible for the 
transmission of inheritance, Lamarckian theories of inheritance fell out 
of favor. Following the rise of the neo-Darwinian synthesis, most 
evolutionary biologists came to regard the gene as the locus of all 
heritable change in the organism. And after 1953, biologists equated the 
gene with specifically arranged nucleotide bases within the DNA 
molecule. 

Recently, however, as more biologists have recognized that some 
biological information—epigenetic information—tresides in structures 
outside of DNA, interest has grown in the possibility that these 
nongenetic sources of information may influence the course of 
evolution. The discovery that epigenetic information can be altered and 
directly inherited independently of DNA has attracted further attention. 
This discovery has, in turn, led to the formulation of a contemporary 
“neo-Lamarckian”! theory that envisions changes in the nongenetic 
structures of an organism affecting subsequent generations. 

Today, prominent defenders of neo-Lamarckism include Eva 
Jablonka, of Tel Aviv University, and Massimo Pigliucci, of the City 
University of New York. Lamarck, of course, knew nothing about the 
role of genes and believed that inheritance of acquired characteristics 
was an important driving force in evolution. Modern neo-Lamarckians, 
fully apprised of the reality of genetic inheritance, nevertheless think 
that nongenetic sources of information and structure may play some role 
in the evolution of biological form. According to Jablonka, neo- 
Lamarckism “allow[ s] evolutionary possibilities denied by the ‘Modern 
Synthesis’ version of evolutionary theory, which states that variations 
are blind, are genetic (nucleic acid—based), and that saltational events do 
not significantly contribute to evolutionary change.” 

Jablonka has collected several categories of evidence in support of 
what she calls “epigenetic inheritance systems.” In the first place, in 
some single-celled organisms (such as Æ. coli and yeast) 
environmentally induced changes in metabolic pathways can be 
transmitted to the next generation independently of any changes in the 


cell’s DNA. Second, she notes that structural information mediating 
organismal form (and function) does pass from parent to offspring 
independently of DNA, via membranes and other three-dimensional 
cellular patterns. 

Third, she discusses the process of DNA methylation—a process in 
which special enzymes attach a methyl group (CH;) to nucleotide bases 


within the double helix. Processes like this can alter gene regulation and 
chromatin structure. Jablonka notes that the changes produced by 
processes that alter gene regulation are often transmitted to subsequent 
generations of cells without any changes to DNA base sequences. 
Finally, she cites a process called “RNA-mediated” epigenetic 
inheritance, a recently discovered phenomenon. Here, small RNAs, 
again acting in concert with special enzymes, affect gene expression and 
chromatin structure, and these modifications appear to be heritable 
independently of genes. 

Can any of these mechanisms help to explain the origin of animal 
form in the Cambrian explosion? Not really. 

By its nature, macroevolution requires stable—meaning permanently 
heritable—changes. But Jablonka’s evidence shows that where 
nongenetic inheritance occurs in animals, it involves structures that 
either (a) do not change (such as membrane patterns and other persistent 
templates of structural information), or (b) do not persist over more than 
several generations. And neither case generates significant evolutionary 
innovation in animal form. Instead, for directional evolutionary change 
to occur in a population of organisms, changes must not only be 
heritable, but permanent. Stability—the irreversible and enduring 
heritability of traits—is a logically inescapable requirement for any 
theory of evolution. This is precisely what “descent with modification” 
means. 

And here Jablonka’s evidence for stable nongenetic inheritance is 
equivocal at best, as she readily admits. Reviewing Jablonka’s 
assembled data for animals reveals no case where an induced epigenetic 
change persisted permanently in any population. The heritability of such 


changes is transient, lasting (depending on the species in question) from 
a few generations up to forty. 

Jablonka candidly addresses this lack of evidence for stability, 
noting, “We believe that epigenetic variants in every locus in the 
eukaryotic genome can be inherited, but in what manner, for how long, 
and under what conditions, has yet to be qualified.”>’ Consequently, 
despite its intriguing aspects, the evolutionary significance of neo- 
Lamarckian epigenetic inheritance remains uncertain or, in Jablonka’s 


own words, “inevitably, somewhat speculative.”°4 


NATURAL GENETIC ENGINEERING 


University of Chicago geneticist James Shapiro has formulated another 
post-Darwinian perspective on how evolution works that he calls 
“natural genetic engineering.” Shapiro has developed an understanding 
of evolution that takes account of the integrated complexity of 
organisms as well as the importance of nonrandom mutations and 
variations in the evolutionary process. 

He observes that organisms within a population often modify 
themselves in response to different environmental challenges. He cites 
evidence showing that when populations are challenged by 
environmental stresses, signals, or triggers, organisms do not generate 
mutations or make genetic changes randomly, that is, without respect to, 
or unguided by, their survival needs. Instead, they often respond to 
environmental stresses or signals by inducing mutations in a directed or 
regulated way. As he explains, “The continued insistence on the random 
nature of genetic change by evolutionists should be surprising for one 
simple reason: empirical studies of the mutational process have 
inevitably discovered patterns, environmental influences, and specific 
biological activities at the roots of novel genetic structures and altered 
DNA sequences.””>> 

The depth of Shapiro’s challenge to orthodox neo-Darwinism is 
profound. He rejects the randomness of novel variation that Darwin 
himself emphasized and that neo-Darwinian theorists throughout the 


twentieth century have reaffirmed.°° Instead, he favors a view of the 
evolutionary process that emphasizes preprogrammed adaptive capacity 
or “engineered” change, where organisms respond in a “cognitive” way 
to environmental influences, rearranging or mutating their genetic 
information in regulated ways to maintain viability. 

As an example, Shapiro notes that—contrary to the neo-Darwinian 
assumption that “DNA alterations are accidental”>’—all organisms 
possess sophisticated cellular systems for proofreading and repairing 
their DNA during its replication. He notes that these systems are 
“equivalent to a quality-control system in human manufacturing,” where 
the “surveillance and correction” functions represent “cognitive 
processes, rather than mechanical precision.”>® 

As an example of regulated mutation, Shapiro observes that in 
response to environmental assault—UV damage from sunlight or the 
presence of an antibiotic, for instance—bacteria activate what is known 
as the “SOS response” system. This system makes use of specialized 
error-prone DNA polymerases, normally left unexpressed, that are 
synthesized and set into action, allowing the population to generate a 
much wider range of genetic variation than usual. Bacterial cells 
regulate this process using a DNA-binding protein known as LexA, 
which normally represses the error-prone polymerases. When the SOS 
system is activated by environmental damage, the production of LexA 
first drops dramatically, allowing expression of the error-prone 
polymerases, but then rises, which “ensures that as soon as DNA repair 
occurs .. . LexA [will] reaccumulate and repress the SOS genes.” This 
system allows cells to “replicate DNA that carries unrepaired 
damage,”°? keeping their essential replication machinery moving past a 
stall, in the absence of which the bacterium would die. 

An analogy may help to illustrate what the cell is doing when 
confronted with an environmental challenge. Imagine a military unit, a 
combined armor and infantry battalion, crossing an open plain. 
Suddenly, the battalion falls under a fierce, unrelenting enemy artillery 
barrage, wounding many of its soldiers. To keep the wounded alive until 
the barrage ceases or reinforcements arrive, the commander instructs 


certain members of the unit with destructive skills to disassemble (in 
military jargon, “cannibalize”) a few of the tanks to provide temporary 
armored cover from further incoming shells. His order tells them, 
however, to cease their tank-modifying actions as soon as the barrage 
ends. That is, the unit as a whole tolerates “damage” to some of its 
equipment to save as many of its members as possible. 

In the same way, while at one level their “error-prone” role may 
appear counterintuitive, these mutation-generating DNA polymerases of 
the SOS system actually constitute essential hardware in the cell’s 
defensive armory.°! From Shapiro’s perspective, this survival strategy 
does not exemplify Darwinian randomness, but rather sophisticated 
preprogramming, an “apparatus that even the smallest cells possess” to 
maintain viability.°* What’s more, the carefully regulated expression of 
the SOS response provides evidence that cells employ the system only 
when needed.°? 

In addition to ramping up their mutation rates in specific sections of 
the genome, cells may also change the way they express the genetic 
information that they already carry, expressing some genes that were 
previously unexpressed and suppressing others. Organisms in 
populations under particular stresses may retrieve and access modular 
elements of genetic information stored in disparate locations on the 
genome or even on different chromosomes. Cells will then assemble, or 
concatenate, those modular elements to form a new gene or RNA- 
transcript capable of directing the synthesis of a novel protein or 
proteins that can help the organism survive. 

Shapiro argues that these and other kinds of directed, rather than 
random, genetic changes and responses to stimuli occur under 
“algorithmic control.” He describes the cell as “a powerful real-time 
distributed computing system’°* implementing various “if-then” 
subroutines. This emphatically challenges one of the three key elements 
of the neo-Darwinian triad: the claim that mutations and variations 
occur in a strictly random way. 

During the last fifteen years, Shapiro has published a series of 
fascinating papers about the newly discovered capacities of cells to 


direct or “engineer” the genetic changes they need to remain viable in a 
range of environmental conditions. His work represents a promising 
avenue of new biological research, bringing insight into how the cell’s 
information-processing system modifies and directs the expression of its 
genetic information in real time in response to different signals. 
Shapiro’s work also provides new insights into how observable 
evolutionary changes occur in living populations. 

Could it, then, also provide a solution to the problem of the origin of 
the information necessary to build an animal body plan? It could, except 
for one question that Shapiro’s otherwise brilliant characterization of 
how organisms modify themselves doesn’t address. 

Where does the programming come from that accounts for the 
“preprogrammed adaptive capacity” of living organisms? If, as James 
Shapiro argues, natural selection and exclusively random mutations 
don’t produce this information-rich pre-programming, then what did? In 
the next chapters, I’ll propose an answer to precisely this question. 


17 
The Possibility of Intelligent Design 


The owner of a remote island estate has been murdered while out riding. 
When the local sheriff arrives, he learns there are several obvious 
suspects: the volatile gamekeeper, the owner of a neighboring estate 
with whom the murder victim has had a long-running feud, and the 
estate owner’s estranged wife, who had been living on the island in a 
small mother-in-law cottage. The sheriff quickly learns the basic facts of 
the case. The victim was found dead, facedown on the beach, with his 
horse standing nearby. Any one of the three suspects could have taken a 
rifle, from an unlocked shed at the edge of the property. All were healthy 
enough to have hiked to the scene of the crime. Each of them has a 
motive. And none has an alibi. 

But as the investigation unfolds additional facts come to light. Most 
importantly, when the coroner arrives, he determines that although the 
victim was shot in the stomach and then his head was harshly 
bludgeoned by the butt of the rifle, these injuries served merely to 
conceal the bullet wound that actually killed the estate owner. The man 
was dead when he hit the ground. What killed him was a perfect shot 
entering the head just behind the right ear, exactly where an expert 
marksman would place a bullet. Moreover, ballistics shows that this 
bullet came from a different gun altogether from the one stored in the 
shed, a weapon likely fired from quite a distance. 

The sheriff then returns to the list of suspects and, one by one, 
eliminates them. Abundant evidence shows that none of the three prime 
suspects is a particularly good shot, much less a world-class marksman. 
The landowner’s estranged wife has a shaky hand and no experience 


with firearms. The volatile gamekeeper has extremely poor eyesight. 
And the neighboring landowner turns out to have an alibi after all—as 
well as a broken arm, which would have prevented him from holding the 
kind of rifle from which the bullet was fired. There is, however, one 
other person living on the estate, though not even the other suspects 
suspect him. He is the victim’s loyal and longtime personal assistant, a 
timorous older man much beloved by both the family and the other 
servants. No one wants to consider him as a possible suspect. But is it 
possible that he could have had something to do with the crime after all? 
Might an unexpected suspect—indeed “the butler’—have done it? 

Clearly, standard evolutionary theory has reached an impasse. Neither 
neo-Darwinism nor a host of more recent proposals (punctuated 
equilibrium, self-organization, evolutionary developmental biology, 
neutral evolution, epigenetic inheritance, natural genetic engineering) 
have succeeded in explaining the origin of the novel animal forms that 
arose in the Cambrian period. Yet all these evolutionary theories have 
two things in common: they rely on strictly material processes, and they 
also have failed to identify a cause capable of generating the information 
necessary to produce new forms of life. 

This raises a question. Is it possible that a different or unexpected 
kind of cause might provide a more adequate explanation for the origin 
of the new form and information—as well as the other distinctive 
features—present in the Cambrian explosion? In particular, is it possible 
that intelligent design—the purposeful action of a conscious and rational 
agent—might have played a role in the Cambrian explosion? 





INTRODUCING INTELLIGENT DESIGN 


When the case for intelligent design is made, it’s often hard to get 
contemporary evolutionary biologists to see why such an idea should 
even be considered or why discussions of design should play any role in 
biology at all. Though many biologists now acknowledge serious 
deficiencies in current strictly materialistic theories of evolution, they 


resist considering alternatives that involve intelligent guidance, 
direction, or design. 

Much of this resistance seems to come simply from not 
understanding what the theory of intelligent design is. Many 
evolutionary biologists see intelligent design as a religiously based idea 
—a form of biblical creationism. Others think the theory denies all 
forms of evolutionary change. But contrary to media reports, intelligent 
design is not a biblically based idea, but instead an evidence-based 
theory about life’s origins—one that challenges some, but not all, 
meanings of the term “evolution.” 

Perhaps the best way to explain the theory of intelligent design is to 
contrast it with the specific aspect of the theory of Darwinian evolution 
that it directly challenges. Recall from our opening discussion in Chapter 
1 that the term “evolution” has many different meanings and that 
Darwin’s theory of evolution by natural selection affirmed several of 
them: first, change over time; second, universal common descent; and 
third, the creative power of natural selection acting on random 
variations. In affirming this third meaning of evolution, both classical 
Darwinism and modern neo-Darwinism also affirm what neo-Darwinist 
Richard Dawkins has called the “blind watchmaker” hypothesis. This 
hypothesis holds that the mechanism of natural selection acting on 
random genetic variations (and mutations) can produce not just new 
biological form and structure, but also the appearance of design in 
living organisms. ! 

Darwin argued for this idea in The Origin of Species as well as in his 
letters. Recall the sheep breeding illustration from Chapter 1 where I 
described how both intelligent human breeders and environmental 
change (a series of bitterly cold winters) might produce an adaptive 
advantage in a population of sheep. During the nineteenth century, 
biologists regarded the adaptation of organisms to their environment as 
one of the most powerful pieces of evidence of design in the living 
world. By observing that natural selection had the power to produce 
such adaptations, Darwin not only affirmed that his mechanism could 
generate significant biological change, but that it could explain the 


appearance of design—without invoking the activity of an actual 
designing intelligence. In doing so, he sought to refute the design 
hypothesis by providing a materialistic explanation for the origin of 
apparent design in living organisms. Modern neo-Darwinists also affirm 
that organisms look as if they were designed. They also affirm the 
sufficiency of an unintelligent natural mechanism—mutation and natural 
selection—as an explanation for this appearance. Thus, in both 
Darwinism, and  neo-Darwinism, the _ selection/variation (or 
selection/mutation) mechanism functions as a kind of “designer 
substitute.” As the late Harvard evolutionary biologist Ernst Mayr 
explains: “The real core of Darwinism . . . is the theory of natural 
selection. This theory is so important for the Darwinian because it 
permits the explanation of adaptation, the ‘design’ of the natural 
theologian, by natural means.”* Or as another prominent evolutionary 
biologist, Francisco Ayala, has put it succinctly, natural selection 
explains “design without a designer.” 

Other contemporary neo-Darwinian biologists including Richard 
Dawkins, Francis Crick, and Richard Lewontin have also emphasized 
that biological organisms only appear to have been designed.* They 
recognize that many biological structures—whether the chambered 
nautilus, the compound eye of a trilobite, the electrical system of the 
mammalian heart, or numerous molecular machines—attract our 
attention because the sophisticated organization of such systems is 
reminiscent of our own designs. Dawkins has noted, for example, that 
the digital information in DNA bears an uncanny resemblance to 
computer software or machine code.° He explains that many aspects of 
livings systems “give the appearance of having been designed for a 
purpose.’ 

Nevertheless, neo-Darwinists regard that appearance of design as 
entirely illusory, as did Darwin himself, because they think that purely 
mindless, materialistic processes such as natural selection and random 
mutations can produce the intricate designed-like structures in living 
organisms. In this view, natural selection and random mutation mimic 


the powers of a designing intelligence without themselves being 
intelligently directed or guided in any way. 

That’s where the theory of intelligent design comes into play. 
Intelligent design challenges the idea that natural selection and random 
mutation (and other similarly undirected materialistic processes) can 
explain the most striking appearances of design in living organisms. 
Instead, it affirms that there are certain features of living systems that are 
best explained by the design of an actual intelligence—a conscious and 
rational agent, a mind—as opposed to a mindless, materialistic process. 
The theory of intelligent design does not reject “evolution” defined as 
“change over time” or even universal common ancestry, but it does 
dispute Darwin’s idea that the cause of major biological change and the 
appearance of design are wholly blind and undirected. 

Nor does the theory seek to insert into biology an extraneous 
religious concept. Intelligent design addresses a key scientific question 
that has long been part of evolutionary biology: Is design real or 
illusory? Indeed, part of what Darwin set out to explain was precisely 
the appearance of design. With current materialistic evolutionary 
theories now failing to explain many of the most striking appearances of 
design in the Cambrian animals, including the presence of digital 
information as well as other complex adaptations, the possibility 
emerges that these appearances of design may not be just appearances 
after all. The Darwinian formulation of evolutionary theory in 
opposition to the design hypothesis,’ coupled with the inability of neo- 
Darwinian and other materialistic theories to account for salient 
appearances of design, would seem logically to reopen the possibility of 
actual (as opposed to apparent) design in the history of animal life. 

Either life arose as the result of purely undirected material processes 
or a guiding or designing intelligence played a role. Advocates of 
intelligent design favor the latter option and argue that living organisms 
look designed because they really were designed. Design proponents 
argue that living systems exhibit telltale indicators of prior intelligent 
activity that justify this claim, indicators that make intelligent design 
scientifically detectable from the evidence of the living world. 


But that, for many evolutionary biologists, is precisely the rub. 
Because they think of intelligent design as a religiously based idea, they 
understand that people might want to affirm the intelligent design of life 
as part of their religious beliefs—but not as a consequence of scientific 
evidence. Indeed, most evolutionary biologists don’t see how the idea of 
intelligent design could contribute to a scientific explanation of life’s 
origins, nor do they see how intelligent design could ever be detected or 
inferred scientifically from evidence in nature. Exactly how would 
researchers justify such an inference? 


MY STORY 


When I left for my graduate studies in England in 1986, I was asking a 
similar set of questions. At that time, I wasn’t thinking about the 
scientific legitimacy of the intelligent design hypothesis as an 
explanation for the origin of animals. Instead, I wanted to know if 
intelligent design could help explain the origin of life itself. My 
questions eventually led me to learn about a distinctive method of 
historical scientific inquiry. That discovery led me to a method of 
reasoning that allows for the detection or inference of past causes, 
including intelligent causes. 

A year earlier, in 1985, I had met one of the first contemporary 
scientists to revive the idea that intelligent design might have played a 
causal role in the origins of life. Chemist Charles Thaxton (see Fig. 17.1) 
had recently published a book, The Mystery of Lifes Origin. His 
coauthors were polymer scientist and engineer Walter Bradley and 
geochemist Roger Olsen. Their book received acclaim as a 
groundbreaking critique of current theories of chemical evolution. They 
showed that attempts to explain the origin of the first living cell from 
simpler nonliving chemicals had failed and that these theories had 
specifically failed to explain the origin of the information necessary to 
produce the first life. 

But it was in the book’s epilogue that the three scientists proposed a 
radical alternative. There they suggested that the information-bearing 


properties of DNA might point to the activity of a designing intelligence 
—to the work of a mind, or an “intelligent cause” as they put it. 
Drawing on the analysis of the British-Hungarian physical chemist 
Michael Polanyi, they argued that chemistry and physics alone could not 
produce the information in DNA any more than ink and paper alone 
could produce the information in a book. Instead, they argued that our 
uniform experience suggests a cause-and-effect relationship between 
intelligent activity and the production of information.’ 





FIGURE 17.1 
Charles Thaxton. Courtesy Charles Thaxton. 


At the time the book appeared, I was working as a geophysicist for an 
oil company in Dallas where Thaxton happened to live. I met him at a 
scientific conference and became intrigued by his work. Over the next 
year, I began dropping by his office to discuss his book and the radical 
idea he was developing about DNA. 

The first part of Thaxton’s argument made sense to me. Experience 
does indeed seem to affirm that (specified or functional) information 
typically arises from the activity of intelligent agents, from minds as 
opposed to mindless, material processes. When a “tweet” appears on 
your smart phone’s Twitter feed (if you’re into that kind of thing), it 
clearly originated first in the mind of a person who created a Twitter 
account, scripted the “tweet,” and then sent it out across the Internet. 
Information does arise from minds. 

But Thaxton went further. He acknowledged that most branches of 
science didn’t consider intelligent activity as an explanation because, he 
thought, intelligent agents don’t usually generate repeatable or 
predictable phenomena and because they are difficult to study under 
controlled laboratory conditions. Nevertheless, Thaxton argued that 
scientists might propose an intelligent cause as a positive scientific 
explanation for some events in the past, as part of a special mode of 
scientific inquiry he called origins science. He noted that scientific 
disciplines such as archaeology, evolutionary biology, cosmology, and 
paleontology often infer the occurrence of singular, nonrepeatable events 
and that the methods used to make such inferences could help scientists 
identify positive indicators of intelligent causes in the past as well. 

Here I wasn’t initially so sure. Thaxton’s ideas about a distinctive 
method of science concerned with origins, or at least with the past 
generally, seemed intuitively plausible. After all, evolutionary biologists 
and paleontologists do seem to use a method of investigation different 
from that employed by laboratory chemists. Nevertheless, I wasn’t 
exactly sure what those methods were, how they were different from 


those used in other sciences, and whether using them in any way 
justified considering intelligent design as a scientific hypothesis. 

So the next year when I left Dallas, Texas, for Cambridge, England, 
to pursue my studies in the history and philosophy of science, I had a lot 
on my mind. Is there a distinctive method of historical scientific inquiry? 
If so, does that method of reasoning and investigation justify a scientific 
reformulation of the design hypothesis? In particular, does the intuitive 
connection between information and the prior activity of a designing 
intelligence justify a positive (historical) scientific inference to 
intelligent design? Does it make intelligent design detectable? 


HISTORICAL SCIENTIFIC METHOD AND THE DESIGN 
HYPOTHESIS 


In my research, I discovered that historical scientists often do make 
inferences with a distinctive logical form. This type of inference is 
known technically as an abductive inference.'!° During the nineteenth 
century, American logician C. S. Peirce characterized this mode of 
reasoning and distinguished it from two better-known forms, inductive 
and deductive reasoning. He noted that in inductive reasoning, general 
rules are inferred from particular facts, whereas in deductive reasoning, 
general rules are applied to particular facts in order to deduce specific 
outcomes. In abductive reasoning, however, inferences are often made 
about past events or causes based on present clues or facts.!! 

To see the difference between these three types of inference, consider 
the following argument forms: 

Inductive argument: 


A, 1s B. 
A, is B. 
A; is B. 
Aq is B. 
A, 1s B. 


All A’s are B. 
Deductive argument: 


MAJOR PREMISE: If A has occurred, then B will follow as a matter 
of course. 

MINOR PREMISE: A has occurred. 

CONCLUSION: Hence, B will follow as well. 


Abductive argument: 


MAJOR PREMISE: If A occurs, then B would be expected as a matter of 
course. 

MINOR PREMISE: The surprising fact B is observed. 

CONCLUSION: Hence, there is reason to suspect that A has occurred. 


Note the difference between deductive and abductive forms of 
inference. In deduction, the minor premise affirms the antecedent 
variable (“A”), while the conclusion deduces the consequent variable 
(“B”), an anticipated outcome. In this sense, deductive inferences look 
forward to something that will happen in the future. A classic illustration 
of deductive reasoning has this character: 


MAJOR PREMISE: All men are mortal. 
MINOR PREMISE: Socrates is a man. 
CONCLUSION: Therefore, Socrates is a mortal (i.e., he will die). 


In an abductive argument, the minor premise affirms the consequent 
variable (“B”) and its conclusion infers the antecedent variable (“A”)— 
the variable referring to something that went before, either logically or 
temporally. Abductive reasoning, thus, often affirms a past occurrence. 
For this reason, forensic or historical scientists such as geologists, 
paleontologists, archaeologists, and evolutionary biologists often use 
abductive reasoning to infer past conditions or causes from present 


clues. As Stephen Jay Gould notes, historical scientists typically “infer 
history from its results.” 1? 
For example, a geologist might reason as follows: 


MAJOR PREMISE: If a mudslide occurred, we would expect to find 
felled trees. 

MINOR PREMISE: We find evidence of felled trees. 

CONCLUSION: Therefore, we have reason to think that a mudslide may 
have occurred. 


In the deductive form, if the premises are true, the conclusion follows 
with certainty. The logic of the abductive arguments is different, 
however. Abductive arguments do not produce certainty, but instead 
merely plausibility or possibility. To see why, consider the following 
variation of the preceding abductive argument: 


MAJOR PREMISE: If a mudslide occurred, we would expect to find 
felled trees. 

MINOR PREMISE: We find felled trees. 

CONCLUSION: Therefore, a mudslide occurred. 


or symbolically: 


MAJOR PREMISE: If MS, then FT. 
MINOR PREMISE: FT. 
CONCLUSION: Therefore, MS. 


Notice that unlike the first version of the abductive argument in 
which the conclusion was stated tentatively (“We have reason to think 
that a mudslide may have occurred”), in this version the conclusion is 
affirmed definitively (“A mudslide occurred’). Obviously, this latter 
form of argument has a problem. It does not follow that, because the 
trees have fallen, a mudslide necessarily occurred. The trees may have 
fallen for some other reason. A hurricane may have blown them down; 


perhaps an ice storm occurred and the trees fell under the weight of 
accumulating ice; or loggers may have cut them down. In logic, 
affirming the consequent variable of a minor premise (with certainty) 
constitutes a formal fallacy—a fallacy that derives from the failure to 
acknowledge that more than one cause (or antecedent) might produce 
the same evidence (or consequent). 

Even so, the presence of downed timber might indicate that a 
mudslide has occurred. Thus, amending the above argument to conclude: 
“We have reason to think that a mudslide may have occurred” does not 
commit a fallacy. Even if we may not affirm the consequent with 
certainty, we may affirm it as a possibility. This is precisely what 
abductive reasoning does. It provides a reason for considering that a 
hypothesis—and often a hypothesis about the past—might be true, even 
if one cannot affirm the hypothesis (or conclusion) with certainty. 1° 


THE METHOD OF MULTIPLE COMPETING HYPOTHESES 


To address this limitation in abductive reasoning and to make it possible 
to strengthen inferences about the past, the nineteenth-century geologist 
Thomas Chamberlain developed a form of reasoning he called “the 
method of multiple working hypotheses.”!* Historical and forensic 
scientists employ this method when more than one cause or hypothesis 
can account for the same evidence. They use it to adjudicate between 
competing hypotheses by comparing them to see which one best 
explains not just one piece of evidence but, usually, a wider class of 
relevant facts. 

For example, consider how this method of reasoning was used to 
establish the hypothesis of continental drift as the best explanation for a 
wide range of geological observations. During the early 1900s, a 
German geologist and meteorologist named Alfred Wegener became 
fascinated with the way the African and South American continents fit 
together on the map like pieces of a jigsaw puzzle.!° He proposed that 
the continents had once been fused together as a single giant continent 
that he called “Pangea,” which later separated and drifted apart. 


Initially, many geologists ridiculed Wegener’s idea. They thought that 
—given the vast distances separating the continents—the matching 
shapes were most likely just a coincidence. Wegener’s critics dismissed 
his theory of continental drift as “delirious ravings,” “Germanic pseudo- 
science,” or a “fairy tale.”! But Wegener cited other evidence that he 
thought continental drift could explain that the coincidence hypothesis 
could not. He noted that fossil forms discovered on the east coast of 
South America matched those on the west coast of Africa in 
corresponding places and sedimentary strata. This fact seemed too 
coincidental to him to be explained away by chance alone. Nevertheless, 
other geologists attempted to explain matching fossil forms an ocean 
apart not as the result of the movement of the continents, but instead as 
the result of the migration of flora and fauna—either across oceans or 
over ancient land bridges.!’ This introduced a third hypothesis into the 
mix, one that, in conjunction with the coincidence hypothesis, could 
explain each of the same facts that Wegener’s hypothesis could. 

Later, however, an additional set of facts came to light—one that 
helped scientists to decide between the competing hypotheses. During 
World War II, the United States Navy surveyed the seafloor topography 
and measured the earth’s magnetic field across the oceans. These 
magnetic surveys showed parallel stripes of magnetized rock, each with 
the same polarity on either side of mountain ridges running down the 
middle of the ocean floor at equal distances from the mid-oceanic 
mountain ranges.!® Geologists also learned that magma was continually 
seeping out at the middle of these mid-oceanic mountain ranges. They 
discovered that as the magma cools, it “acquires” a characteristic 
magnetic signature reflecting the polarity of the earth’s magnetic field at 
that location at the time of its cooling. When ships towing sensitive 
magnetometers measured this “remanent magnetization,” scientists 
learned that the magnetization of the seafloor alternated between 
sections of “normal” and “reverse” polarity as the magnetometer was 
towed away from a mid-ocean ridge in each direction. This led to the 
discovery of a famous symmetrical “piano key” pattern on each side of a 
mid-ocean ridge, seen in Figure 17.2. 


To explain this symmetrical pattern of alternating magnetism, 
geologists proposed that the magnetic stripes were formed as the result 
of the seafloor spreading away from the mid-ocean ridge as magma was 
extruded and cooled in the presence of earth’s changing magnetic field 
—in other words, that the continents were literally drifting apart. This 
hypothesis not only explained the symmetrical pattern of magnetic 
stripes but also other relevant evidence. Although the other hypotheses 
could explain (or explain away) the fit of the continents and/or the 
similar pattern of fossilization across the oceans, only continental drift 
(driven by seafloor spreading) could explain the magnetic seafloor 
stripes and these other pieces of evidence. Consequently, as the result of 
its superior explanatory power, a decisive case for continental drift was 
soon established, strengthening a merely plausible abductive inference 
about the past movement of the continents by showing that this inference 
provided the best (and only adequate) explanation of all the relevant 
facts.!° 
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FIGURE 17.2 

This diagram shows the symmetrical pattern of alternating magnetic 
stripes of either “normal” or “reversed” polarity on either side of a mid- 
ocean ridge. Because this “piano key” pattern could only be explained 
by plate tectonics and seafloor spreading, it contributed to the 
widespread acceptance of those theories within contemporary geology. 


Contemporary philosophers of science such as Peter Lipton have 
called this method of reasoning “inference to the best explanation.”° 
Scientists often use this method when trying to explain the origin of an 
event or structure from the past. They compare various hypotheses to see 
which would, if true, best explain it.2! They then provisionally affirm the 
hypothesis that best explains the data as the one that is most likely to be 
true. 

Obviously, saying, “The best explanation is the one that best explains 
the facts or that best explains the most facts,” begs an important 
question. What does it mean to explain something well or best? 

As it happens, historical scientists have developed criteria for 
deciding which explanation, among a group of competing possible 
hypotheses, provides the best explanation for some event in the remote 
past. The most important of these criteria is “causal adequacy.” As a 
condition of formulating a successful explanation, historical scientists 
must identify causes that are known to have the power to produce the 
kind of effect, feature, or event in question. In seeking to identify such 
causes, historical scientists evaluate hypotheses in light of their present 
knowledge of cause and effect. Causes that are known to produce the 
effect in question (or are thought capable of doing so) are judged to be 
better candidates than those that are not. For instance, a volcanic 
eruption provides a better explanation for an ash layer in the earth than 
an earthquake or a flood, because eruptions have been observed to 
produce ash layers, whereas earthquakes and floods have not. 

One of the first historical scientists to develop the criterion of causal 
adequacy was the geologist Charles Lyell (1797—1875), who in turn 
influenced Charles Darwin. Darwin read Lyell’s magnum opus, The 
Principles of Geology, during his voyage on the HMS Beagle and 
employed Lyell’s principles of reasoning in On the Origin of Species. 
The subtitle of Lyell’s book summarized his central methodological 
principle: Being an Attempt to Explain the Former Changes of the 
Earth's Surface, by Reference to Causes Now in Operation (1830—1833). 
Lyell argued that when scientists seek to explain events in the past, they 
should not invoke some unknown type of cause, the effects of which we 


have not observed. Instead, they should cite causes that are known from 
our uniform experience to have the power to produce the effect in 
seston 22 Historical scientists should cite presently acting causes, that 
is, “causes now in operation.” This was the idea behind his 
uniformitarian method and its famous dictum: “The present is the key to 
the past.” According to Lyell, our present experience of cause and effect 
should guide our reasoning about the causes of past events. Darwin 
adopted this methodological principle as he sought to demonstrate that 
natural selection qualified as a vera causa, that is, a true, known, or 
actual cause of significant biological change.” In other words, he sought 
to show that natural selection was “causally adequate” to produce the 
effects he was trying to explain. 


THE ONLY KNOWN CAUSE 


Both philosophers of science and leading historical scientists have 
emphasized causal adequacy as the key criterion by which competing 
hypotheses are judged. But philosophers of science have insisted that 
assessments of explanatory power lead to conclusive inferences only 
when there is just one known cause for the effect or evidence?” (see Fig. 
17.3) in question. If there are many causes that can produce the same 
effect, then the presence of the effect does not definitively establish the 
cause. When scientists know of only one cause for a given effect, 
however, they can infer that cause and yet avoid the fallacy of affirming 
the consequent—the error of ignoring other possible causes with the 
power to produce the same effect.?> In that case, they can infer or detect 
a uniquely plausible past cause from the clues that are left behind. 

This can happen in one of two ways. First, historical scientists might 
focus their investigation on a single fact (in isolation) for which only one 
cause happens to be known. In such a case, they can quickly and 
decisively infer the cause from the effect alone—without risk of 
affirming the consequent, because no other known cause produces the 
same effect. For example, because a volcanic eruption is the only known 


cause of a volcanic ash layer, the presence of such a layer at an 
archeological site strongly indicates the prior eruption of a volcano. 





FIGURE 17.3 

Schematic of the logical problem of retrodiction. Whether it is possible 
to reconstruct the past definitively or not depends upon whether there is 
a single cause or condition that gives rise to a present state or whether 
there are many possible past causes or conditions that give rise to a 
given present state. The diagram on the left portrays an information- 
destroying situation in which many past causes (or conditions) 
correspond to a given present state. The diagram on the right portrays an 
information-preserving situation in which only one past cause (or 
condition) corresponds to a present state. Adapted from Sober, 
Reconstructing the Past, 4. 


In other cases where historical scientists encounter evidence for 
which there are many known causes, they will often broaden their 
investigation beyond an initial fact or set of facts. In such cases, they 
will use the strategy described above (as part of the method of multiple 
competing hypotheses), by looking for additional evidence until they 
find a piece for which there is only one known cause. They can then 
compare the explanatory power of the competing hypotheses. Using this 
strategy, historical scientists will choose the proposed cause with the 
demonstrated power to produce all the relevant evidence, including the 
new fact or piece of evidence for which there is only one known cause. 
For example, the discovery of the symmetrical pattern of ocean-floor 


magnetism on opposite sides of a mid-oceanic ridge allowed for a 
comparison of the explanatory power of the three hypotheses under 
consideration, leaving only seafloor spreading as a causally adequate 
explanation of all the relevant facts. 

Such an approach often allows historical scientists to pick out a piece 
of evidence (from some combination of effects) for which there is only 
one known (or theoretically plausible) cause, thus making it possible to 
establish a past cause decisively. Though this strategy involves looking 
at a wider class of facts than the first strategy, the logical status of the 
inferences involved is the same. In each case, the presence of a fact 
(either standing on its own or in combination with other facts) for which 
only one cause is known allows historical scientists to make a definitive 
inference about the causal history in question without committing the 
fallacy of affirming the consequent. Logically, if a postulated cause is 
known to be a necessary condition or cause of a given event or effect, 
then historical scientists can validly infer that condition or cause from 
the presence of the effect. If it’s true that where there is smoke there is 
always first fire, then the presence of smoke wafting up over a distant 
mountain range decisively indicates the prior presence of a fire on the 
other side of the ridge. 


HISTORICAL INFERENCE AND INTELLIGENT DESIGN 


What does all this have to do with the Cambrian explosion? 

Quite a lot. In my investigation of the historical scientific method, I 
found that whether they always realize it or not, historical scientists 
typically use the method of inference to the best explanation. They make 
abductive inferences about past causes from present clues, evidence, or 
effects. This later suggested to me that if there were features of the 
Cambrian explosion or the Cambrian animals that would be “expected as 
a matter of course” if an intelligent designer had played a role in that 
event, then it was at least possible to formulate the hypothesis of 
intelligent design as a historical (abductive) scientific inference. An 


advocate of intelligent design could reason in a standard historical 
scientific way: 


MAJOR PREMISE: If intelligent design played a role in the Cambrian 
explosion, then feature (X) known to be produced by intelligent 
activity would be expected as a matter of course. 

MINOR PREMISE: Feature (X) is observed in the Cambrian explosion of 
animal life. 

CONCLUSION: Hence, there is reason to suspect that an intelligent 
cause played a role in the Cambrian explosion. 


Of course, a historical scientist would only be justified in making 
such an abductive inference to the past activity of an intelligent cause if 
“feature X” is evident in the Cambrian explosion and if intelligent 
design is known to produce “feature X.” Moreover, just because the 
Cambrian explosion may exhibit some feature or features for which 
intelligent design is a known cause does not mean that intelligent design 
was necessarily the actual cause (or the best explanation) of those 
features. Only if the Cambrian event and animals exhibit features for 
which intelligent design is the only known cause may a historical 
scientist make a decisive inference to a past intelligent cause. 

We are left with two crucial questions. Are there in fact such features 
present in the record of the Cambrian explosion or in the animals that 
arise in it—features that are known from our experience to be produced 
by intelligent causes such that they would justify making a tentative 
abductive inference to intelligent design? Are there also perhaps features 
of the Cambrian event that are known from our experience to be 
produced by intelligent causes, and only intelligent causes, justifying a 
more definitive inference to past intelligent activity as the best 
explanation for the relevant evidence? Might “the butler” have done it 
after all? 


18 


Signs of Design in the Cambrian 
Explosion 


Well-crafted mystery novels, like real-world crime investigations, unfold 
with a distinctive logic. There is a death to be explained and, at the start, 
an indefinitely large universe of possible causes. That universe can be 
made smaller, narrowing to the one true cause, as more and more clues 
come to light. Those clues typically come in two forms: positive 
evidence, or indicators of what likely happened (e.g., .38 caliber shell 
casings on the ground and bullet wounds in a body) and negative 
evidence, or indicators of what could not have happened. 

Let’s say that the local sheriff who discovered the body of the estate 
owner (from my illustration in the previous chapter), did so as he was 
making his rounds on a dirt road that makes a close approach to the 
beach at the end of the estate where the owner died. And let’s say that, as 
a result, the sheriff just happened to find the body soon after the murder 
had taken place. Let’s suppose, further, that the sheriff had the presence 
of mind to immediately measure the temperature of the body only to find 
that the victim was still warm, indeed almost as warm as a living person. 
Clearly, in this situation, the sheriff would conclude that the victim had 
just died. At that point in the investigation, a physical regularity would 
govern the sheriff’s reasoning, one that tells a lot about who didnt 
commit the murder. Following death, the human body cools to the 
surrounding temperature at a known rate. So, making allowances for 
vehicular transport, whoever committed the murder could not have gone 


beyond a certain distance from the remote estate at the time the body 
was found. 

These facts would immediately provide a rock-solid alibi to the vast 
majority of humanity, anyone located safely outside that radius when the 
body was discovered. Of course, calling this information a negative clue 
is really only a convention of naming. “Negative” and “positive” refer to 
how we conceive of the implications of a fact, but not to the fact itself: 
the evidence, after all, is what it is. Even so, facts both exclude and 
allow competing possible hypotheses. As they accumulate, they 
typically paint a picture, a profile, of the actual cause of the event in 
need of explanation. Thus, when we say “the body temperature of the 
deceased rules out the 7 billion people who were well beyond the radius 
set by the cooling rate,” we could equally well have said, “the body 
temperature implicates some person within 30 miles of the estate when 
the sheriff arrived,” a population of possible suspects much smaller than 
when we started. 

As I have described the many attempts to explain the scientific 
enigma motivating this book, that mystery has, in one sense, 
progressively deepened. As more and more attempts to explain the 
Cambrian explosion of animal life have failed, the evidence that these 
various competing theories fail to explain may be considered a set of 
negative clues—evidence that effectively precludes certain possible 
causes or explanations. I’ve already explained why the received version 
of evolutionary theory, neo-Darwinism, fails to account for the explosion 
of information and form in the Cambrian period. I’ve also examined 
more recent evolutionary theories and shown why they too fail to 
explain key aspects of the evidence. To this point, then, much of the 
evidence has returned a negative verdict. It has told us a lot about what, 
in all probability, did not cause the Cambrian explosion. But, as in our 
hypothetical murder case, an accumulating body of evidence that makes 
one set of explanations less and less plausible may also begin to paint a 
picture of an alternative cause and the true explanation. 


PROFILE OF THE SUSPECT 


Long before detectives know the actual identity of a suspect, they will 
often compose a profile of the person they are seeking. One leading 
paleontologist has used this strategy to begin to draw a bead on the cause 
responsible for the Cambrian explosion. Douglas Erwin has dedicated 
his career to solving the problem of the origin of animal body plans (see 
Fig. 18.1). Trained at the University of California by James Valentine, 
another Cambrian expert, Erwin has worked closely over the past decade 
with Eric Davidson, whom we first met in Chapter 13, trying to 
determine what happened to cause dozens of novel body plans to appear 
—and appear rapidly—in the Cambrian period. 

Both Erwin and Davidson have now ruled out standard neo- 
Darwinian theory—vehemently in Davidson’s case. He says that the 
standard theory “gives rise to lethal errors.”! But Erwin and Davidson 
go further. They have assembled what is, in essence, a clue sheet—a list 
of key evidences that must be explained. Using that list, they have begun 
to sketch, at least in outline, a profile of the cause behind the Cambrian 
explosion. 

On the positive side of the ledger, they conclude that this cause must 
have several attributes in order to explain key facts about the fossil 
record as well as what it takes to build animals. In particular, the cause 
must be capable of generating a top-down pattern of appearance; it must 
be capable of generating new biological form relatively rapidly; and it 
must be capable of constructing, not merely modifying, complex 
integrated genetic circuits (specifically, the developmental gene 
regulatory networks discussed in Chapter 13). 

On the negative side, Davidson and Erwin rule out both observed 
microevolutionary processes and postulated macroevolutionary 
mechanisms (such as punctuated equilibrium and species selection) as 
explanations for the origin of the key features of the Cambrian 
explosion. They insist that the requirements for constructing animal 
body plans de novo “cannot be accommodated by microevolutionary [ or] 
macroevolutionary theory.” 





FIGURE 18.1 
Douglas Erwin. Courtesy UPHOTO/Cornell University. 


In Chapter 13, I discussed their reason for coming to this conclusion: 
developmental gene regulatory networks, once in place, cannot be 
perturbed (or mutated) without “catastrophic”? consequences to the 
developing animal. Thus, fundamentally new gene regulatory networks 
(dGRNs) cannot evolve gradually from preexisting dGRNs, if those 
evolutionary changes require perturbing the deepest nodes of the earlier 
dGRNs. Yet building new dGRNs capable of producing new animals 
requires precisely such fundamental alterations in preexisting dGRNs. 
But then how would new regulatory networks ever arise? Davidson and 
Erwin insist that no current theory of evolution explains the origin of 
these systems. Thus, they conclude that the cause of the Cambrian 
explosion is not described by any currently proposed theory of micro- or 
macroevolution. 

In saying this, Erwin emphasizes the uniqueness of the innovations 
that occurred in the Cambrian explosion. He explains: “Unlike later 
events, the most significant developmental events of the Cambrian 
radiation involved the proliferation of cell types, developmental 
hierarchies and epigenetic cascades.”* Consequently, he concludes, “The 
crucial difference between the developmental events of the Cambrian 
and subsequent events is that the former involved the establishment of 
these developmental patterns, not their modification.”° For this reason, 
Erwin denies that the central event of the Cambrian explosion—the 


origin of novel body plans—has any parallel to currently observed 
biological processes. Rather, he insists that the events of the past were 
fundamentally different—that profound asymmetries exist between 
evolution then, and evolution now. Thus, he amplifies his denial of the 
sufficiency of current evolutionary theory by adding one additional 
attribute, albeit a negative one, to his portrait of “the suspect”: the cause 
responsible for generating the new animal forms, whatever it was, must 
have been unlike any observed biological process operating in actual 
living populations today. 


PROFILING A CAUSE 


All of this raises an obvious question. Could the negative clues that 
increasingly disconfirm materialistic evolutionary theories also be 
positive indicators of a different kind of cause—perhaps even an 
intelligent cause? 

By sketching the profile of the kind of cause needed to explain the 
origin of animal life, Davidson, Erwin, and many other evolutionary 
biologists may have, inadvertently, rendered the idea of intelligent 
design a bit less inconceivable. To see why, let’s quickly review Erwin 
and Davidson’s profile of the suspect. They have concluded that the 
cause of the origin of the new animal forms in the Cambrian explosion 
must be capable of: 


e generating new form rapidly 
e generating a top-down pattern of appearance 
e constructing, not merely modifying, complex integrated circuits 


They have also concluded that this cause 1s: 


e not described by any currently proposed theory of micro- or 
macroevolution 

e unlike any observed biological process operating in actual living 
populations today 


Erwin and Davidson, no friends of intelligent design, have sketched a 
partial profile of an adequate cause as befits their particular interest in 
the importance of gene regulatory networks (Davidson) and fossil 
discontinuity (Erwin). But other evolutionary biologists have contributed 
to this picture as well. Simon Conway Morris marvels at “the uncanny 
ability of evolution to navigate to the appropriate solution through 
immense ‘hyperspaces’ of biological possibility.”’ As a result, he argues 
that evolution may in some way be “channeled” toward propitious 
functional and/or structural end points—without specifying any known 
evolutionary mechanism that can so direct evolution to such end points.® 
James Shapiro proposes a mechanism of evolutionary change that relies 
on preprogrammed adaptive capacity—without explaining where such 
preprogramming comes from.” Several of the new evolutionary theories 
discussed in the previous chapters presuppose, but do not explain, the 
existence of both genetic and epigenetic forms of information, 
highlighting the need for a cause capable of generating such information 
in the first place. 

Erwin and Davidson have made a bold start with their clue list ruling 
out neo-Darwinism. But the evidence explored in these pages suggests 
additional attributes that need to be added to their profile of the actual 
cause of the Cambrian explosion. Our previous investigations have 
suggested that building an animal requires specified or functional 
information and that any explanation for the origin of the Cambrian 
animals must identify a cause capable of generating: 


e digital information 

e structural (epigenetic) information 

e functionally integrated and hierarchically organized layers of 
information 


Still, do any or even all of these clues add up to a reason for 
considering that an alternative kind of cause—a designing intelligence— 
might have played a role in the origin of animal life? 


They do. As it turns out, each of the features of the Cambrian animals 
and the Cambrian fossil record that constitute negative clues—clues that 
render neo-Darwinism and other materialistic theories inadequate as 
causal explanations—also happen to be features of systems known from 
experience to have arisen as the result of intelligent activity. In other 
words, standard materialistic evolutionary theories have failed to 
identify an adequate mechanism or cause for precisely those attributes of 
living forms that we know from experience only intelligence—conscious 
rational activity—is capable of producing. That suggests, in accord with 
the method of historical scientific reasoning elucidated in the previous 
chapter, the possibility of making a strong historical inference to 
intelligent design as the best explanation for the origin of those 
attributes. 

Let’s have a look at each of these features of the Cambrian event, 
starting with key features of the Cambrian animals themselves, to see 
how they might point to the past activity of a designing intelligence, 
thereby making intelligent design scientifically detectable. 


THE CAMBRIAN INFORMATION EXPLOSION 


We have seen that building a Cambrian (or any other) animal would 
require vast new, functionally specified digital information. Moreover, 
the presence of such digitally encoded information in DNA presents, at 
least, a striking appearance of design in all living organisms. As Richard 
Dawkins observes, for example, “The machine code of the genes is 
uncannily computer-like.”!° Similarly, biotechnology pioneer Leroy 
Hood refers to the information stored in DNA as “digital code” and 
describes it in terms reminiscent of computer software.'! And as we 
have seen, Microsoft’s Bill Gates notes: “DNA is like a computer 
program but far, far more advanced than any software ever created.” 1? 
Yet we’ve also seen that neither neo-Darwinism nor any other 
materialistic evolutionary model or mechanism explains the origin of the 
genetic information (the digital code) necessary to produce the Cambrian 
animals or even the simplest structural innovations that they exhibit. 


Could this—from a materialistic point of view—unexplained appearance 
of design point instead to actual intelligent design? 

I think it does. But to explain why, I need to tell a bit more about the 
“evolution” of my own thinking on the matter. 

After learning about how historical scientists make inferences about 
the causes of events in the remote past, I first applied these methods of 
reasoning to the question of the origin of the information necessary to 
produce the first living cell. My book Signature in the Cell used the 
method of multiple competing hypotheses (or inference to the best 
explanation) to evaluate the “causal adequacy” of proposed explanations 
for the ultimate origin of biological information. I showed that chemical 
evolutionary models (whether based upon chance, physical-chemical 
necessity, or the combination of the two) failed to identify a cause 
capable of producing the digital information in DNA and RNA. Yet we 
do know of a cause that has demonstrated the causal power to produce 
digital code. That cause is intelligent agency. Since intelligent agency is 
the only cause known to be capable of generating information (at least 
starting from nonliving chemicals), intelligent design offers the best 
explanation for the origin of the information necessary to produce the 
first organism. 

The case for intelligent design in Signature was carefully limited as a 
challenge to chemical evolution. Many evolutionary biologists 
acknowledge that chemical evolutionary theory has failed to account for 
the origin of the first life. Many cite its inability to account for the origin 
of biological information as one of the main reasons for that failure. 
Moreover, because they do not think that natural selection could have 
played a significant role in evolution until after the first self-replicating 
organisms had arisen, most evolutionary biologists also think that 
explaining the origin of information in a prebiotic context is much more 
difficult than explaining the origin of new information in already living 
organisms. 

For this reason, in Signature I did not try to argue that intelligent 
design might help explain the origin of the information necessary to 
account for the origin of new animals from simpler preexisting forms of 


life. That would have required a separate demonstration showing the 
inadequacy of natural selection and mutation as a mechanism for 
generating new genetic information in already living organisms. This 
book—in Chapters 9—14—has provided that demonstration. These 
chapters show how neo-Darwinism fails to explain the origin of genetic 
information—at least, in amounts necessary to build a new protein fold. 
Chapters 15 and 16 showed, in addition, that the other main materialistic 
evolutionary theories also fail to account for the information necessary 
to build new forms of animal life. These theories presuppose, rather than 
explain, the origin of the information necessary for structural innovation 
in the history of life. And since the Cambrian explosion of animal life is 
an explosion of information and structural innovation, that raises a 
question. Is it possible that this increase of biological information not 
only represents evidence against materialistic theories of biological 
evolution, but also positive evidence for intelligent design? 


A Cause Now in Operation 


It does. Intelligent agents, due to their rationality and consciousness, 
have demonstrated the power to produce specified or functional 
information in the form of linear sequence-specific arrangements of 
characters. Digital and alphabetic forms of information routinely arise 
from intelligent agents. A computer user who traces the information on a 
screen back to its source invariably comes to a mind—a software 
engineer or programmer. The information in a book or inscription 
ultimately derives from a writer or scribe. Our experience-based 
knowledge of information flow confirms that systems with large 
amounts of specified or functional information invariably originate from 
an intelligent source. The generation of functional information is 
“habitually associated with conscious activity.”!? Our uniform 
experience confirms this obvious truth. 

It also suggests, therefore, that intelligent design meets the key 
“causal adequacy” requirement of a good historical scientific 
explanation. Certainly, intelligence is a “cause now in operation” 


capable of generating functional or specified information in a digital 
form. As I write this, my mind is generating specified information. 
Intelligent agents generate information in the form of software code, 
ancient inscriptions, books, encrypted military codes, and much else. 
And since we know of no “presently acting” materialistic cause that also 
generates large amounts!* of specified information (especially in a 
digital or alphabetic form), only intelligent design meets the causal 
adequacy requirement of a historical scientific explanation. In other 
words, our uniform experience of cause and effect shows that intelligent 
design is the only known cause of the origin of large amounts of 
functionally specified digital information. It follows that the great 
infusion of such information in the Cambrian explosion points decisively 
to an intelligent cause. 

Intelligent design stands alone as an explanation for the origin of 
genetic information for another reason: purposive agents have just those 
necessary powers that natural selection lacks as a condition of its causal 
adequacy. We have seen that natural selection lacks the ability to 
generate novel information precisely because it can only act after new 
functional information has arisen. Natural selection can favor new 
proteins and genes, but only after they perform some function 
(influencing reproductive output). The job of generating new functional 
genes, proteins, and systems of proteins therefore falls entirely to 
random mutations. Yet without functional criteria to guide a search 
through the space of possible sequences, random variation is 
probabilistically doomed. What is needed is not just a source of variation 
(i.e., the freedom to search a space of possibilities) or a mode of 
selection that can operate after the fact of a successful search, but instead 
a means of selection that (a) operates during a search—before success— 
and that (b) is guided by information about or knowledge of a functional 
target. 

Demonstration of this requirement has come from an unlikely 
quarter: genetic algorithms. Genetic algorithms are programs that 
allegedly simulate the creative power of mutation and selection. Richard 
Dawkins, Bernd-Olaf Kiippers, and others have developed computer 


programs that putatively simulate the production of genetic information 
by mutation and natural selection.!° Yet these programs succeed only by 
the illicit expedient of providing the computer with a “target sequence” 
and then treating proximity to future function (1.e., the target sequence), 
not actual present function, as a selection criterion. As mathematician 
David Berlinski shows, genetic algorithms need something akin to a 
“forward-looking memory” in order to succeed.!® Yet such foresighted 
selection has no analogue in nature. In biology, where differential 
survival depends upon maintaining function, natural selection cannot 
occur before new functional sequences arise. Natural selection lacks 
foresight; the process, as evolutionary theorists Rodin and Szathmary 
note, works strictly “ ‘in the present moment,’ right here and right now 
... lacking the foresight of potential future advantages.”!’ 

What natural selection lacks, intelligent design—purposive, goal- 
directed selection—provides. Rational agents can arrange both matter 
and symbols with distant goals in mind. They also routinely solve 
problems of combinatorial inflation. In using language, the human mind 
routinely “finds” or generates highly improbable linguistic sequences to 
convey an intended or preconceived idea. In the process of thought, 
functional objectives precede and constrain the selection of words, 
sounds, and symbols to generate functional (and meaningful) sequences 
from a vast ensemble of meaningless alternative possible combinations 
of sound or symbol.'® Similarly, the construction of complex 
technological objects and products, such as bridges, circuit boards, 
engines, and software, results from the application of goal-directed 
constraints.!” Indeed, in all functionally integrated complex systems 
where the cause is known by experience or observation, designing 
engineers or other intelligent agents applied constraints on the possible 
arrangements of matter to limit possibilities in order to produce 
improbable forms, sequences, or structures. Rational agents have 
repeatedly demonstrated the capacity to constrain possible outcomes to 
actualize improbable but initially unrealized future functions. Repeated 
experience affirms that intelligent agents (minds) uniquely possess such 
causal powers. 


Analysis of the problem of the origin of biological information, 
therefore, exposes a deficiency in the causal powers of natural selection 
and other undirected evolutionary mechanisms that corresponds 
precisely to powers that agents are uniquely known to possess. 
Intelligent agents have foresight. Such agents can determine or select 
functional goals before they are physically instantiated. They can devise 
or select material means to accomplish those ends from among an array 
of possibilities. They can then actualize those goals in accord with a 
preconceived design plan or set of functional requirements. Rational 
agents can constrain combinatorial space with distant information-rich 
outcomes in mind. The causal powers that natural selection lacks—by 
definition—are associated with the attributes of consciousness and 
rationality—with purposive intelligence. Thus, by invoking intelligent 
design to overcome a vast combinatorial search problem and to explain 
the origin of new specified information, contemporary advocates of 
intelligent design are not positing an arbitrary explanatory element 
unmotivated by a consideration of the evidence. Instead, we posit an 
entity possessing precisely the causal powers that a key feature of the 
Cambrian explosion—the explosive increase in specified information— 
requires as a condition of its production and explanation. 


INTEGRATED CIRCUITRY: DEVELOPMENTAL GENE 
REGULATORY NETWORKS 


Keep in mind, too, that animal forms have more than just genetic 
information. They also need tightly integrated networks of genes, 
proteins, and other molecules to regulate their development—in other 
words, they require developmental gene regulatory networks, the 
dGRNs that Eric Davidson has so meticulously mapped over the course 
of his career. Developing animals face two main challenges. First, they 
must produce different types of proteins and cells and, second, they must 
get those proteins and cells to the right place at the right time.?° 
Davidson has shown that embryos accomplish this task by relying on 
networks of regulatory DNA-binding proteins (called transcription 


factors) and their physical targets. These physical targets are typically 
sections of DNA (genes) that produce other proteins or RNA molecules, 
which in turn regulate the expression of still other genes. 

These interdependent networks of genes and gene products present a 
striking appearance of design. Davidson’s graphical depictions of these 
dGRNs look for all the world like wiring diagrams in an electrical 
engineering blueprint or a schematic of an integrated circuit, an uncanny 
resemblance Davidson himself has often noted. “What emerges, from 
the analysis of animal dGRNs,” he muses, “is almost astounding: a 
network of logic interactions programmed into the DNA sequence that 
amounts essentially to a hardwired biological computational device.”?! 
These molecules collectively form a tightly integrated network of 
signaling molecules that function as an integrated circuit. Integrated 
circuits in electronics are systems of individually functional components 
such as transistors, resistors, and capacitors that are connected together 
to perform an overarching function. Likewise, the functional 
components of dGRNs—the DNA-binding proteins, their DNA target 
sequences, and the other molecules that the binding proteins and target 
molecules produce and regulate—also form an integrated circuit, one 
that contributes to accomplishing the overall function of producing an 
adult animal form. 

Yet, as explained in Chapter 13, Davidson himself has made clear 
that the tight functional constraints under which these systems of 
molecules (the dGRNs) operate preclude their gradual alteration by the 
mutation and selection mechanism. For this reason, neo-Darwinism has 
failed to explain the origin of these systems of molecules and their 
functional integration. Like advocates of evolutionary developmental 
biology, Davidson himself favors a model of evolutionary change that 
envisions mutations generating large-scale developmental effects, thus 
perhaps bypassing nonfunctional intermediate circuits or systems. 
Nevertheless, neither proponents of “evo-devo,” nor proponents of other 
recently proposed materialistic theories of evolution, have identified a 
mutational mechanism capable of generating a dGRN or anything even 
remotely resembling a complex integrated circuit. Yet, in our experience, 


complex integrated circuits—and the functional integration of parts in 
complex systems generally—are known to be produced by intelligent 
agents—specifically, by engineers. Moreover, intelligence is the only 
known cause of such effects. Since developing animals employ a form of 
integrated circuitry, and certainly one manifesting a tightly and 
functionally integrated system of parts and subsystems, and since 
intelligence is the only known cause of these features, the necessary 
presence of these features in developing Cambrian animals would seem 
to indicate that intelligent agency played a role in their origin (see Fig. 
13.4). 


THE HIERARCHICAL ORGANIZATION OF GENETIC AND 
EPIGENETIC INFORMATION 


In addition to the information stored in individual genes and the 
information present in the integrated networks of genes and proteins in 
dGRNs, animal forms exemplify hierarchical arrangements or layers of 
information-rich molecules, systems, and structures. For example, 
developing embryos require epigenetic information in the form of 
specifically arranged (a) membrane targets and patterns, (b) cytoskeletal 
arrays, (c) ion channels, and (d) sugar molecules on the exterior of cells 
(the sugar code). As noted in Chapter 13, much of this epigenetic 
information resides in the structure of the maternal egg and is inherited 
directly from membrane to membrane independently of DNA. 

This three-dimensional structural information interacts with other 
information-rich molecules and systems of molecules to ensure the 
proper development of an animal. In particular, epigenetic information 
influences the proper positioning and thus the function of regulatory 
proteins (including DNA-binding proteins), messenger RNAs, and 
various membrane components. Epigenetic information also influences 
the function of developmental gene regulatory networks. Thus, 
information at a higher structural level in the maternal egg helps to 
determine the function of both whole networks of genes and proteins 
(dGRNs) and individual molecules (gene products) at a lower level 


within a developing animal. Genetic information is necessary to specify 
the arrangement of amino acids in a protein or bases in an RNA 
molecule. Similarly, dGRNs are necessary to specify the location and/or 
function of many gene products. And, in a similar way, epigenetic 
information is necessary to specify the location and determine the 
function of lower-level molecules and systems of molecules, including 
the dGRNs themselves. 

Furthermore, the role of epigenetic information provides just one of 
many examples of the hierarchical arrangement (or layering) of 
information-rich structures, systems, and molecules within animals. 
Indeed, at every level of the biological hierarchy, organisms require 
specified and highly improbable (information-rich) arrangements of 
lower-level constituents in order to maintain their form and function. 
Genes require specified arrangements of nucleotide bases; proteins 
require specified arrangements of amino acids; cell structures and cell 
types require specified arrangements of proteins or systems of proteins; 
tissues and organs require specific arrangements of specific cell types; 
and body plans require specialized arrangements of tissues and organs. 
Animal forms contain information-rich lower-level components (such as 
proteins and genes). But they also contain information-rich 
arrangements of those components (such as the arrangement of genes 
and gene products in dGRNs or proteins in cytoskeletal arrays or 
membrane targets). Finally, animals also exhibit information-rich 
arrangements of higher-level systems and structures (such as the 
arrangements of specific cell types, tissues, and organs that form specific 
body plans). 

The highly specified, tightly integrated, hierarchical arrangements of 
molecular components and systems within animal body plans also 
suggest intelligent design. This is, again, because of our experience with 
the features and systems that intelligent agents—and only intelligent 
agents—produce. Indeed, based on our experience, we know that 
intelligent human agents have the capacity to generate complex and 
functionally specified arrangements of matter—that is, to generate 
specified complexity or specified information. Further, human agents 


often design information-rich hierarchies, in which both individual 
modules and the arrangement of those modules exhibit complexity and 
specificity—specified information as defined in Chapter 8. Individual 
transistors, resistors, and capacitors in an integrated circuit exhibit 
considerable complexity and specificity of design. Yet at a higher level 
of organization, the specific arrangement and connection of these 
components within an integrated circuit requires additional information 
and reflects further design (see Fig. 14.2). 

Conscious and rational agents have, as part of their powers of 
purposive intelligence, the capacity to design information-rich parts and 
to organize those parts into functional information-rich systems and 
hierarchies. We know of no other causal entity or process that has this 
capacity. Clearly, we have good reason to doubt that mutation and 
selection, self-organizational processes, or any of the other undirected 
processes cited by other materialistic evolutionary theories, can do so. 
Thus, based upon our present experience of the causal powers of various 
entities and a careful assessment of the efficacy of various evolutionary 
mechanisms, we can infer intelligent design as the best explanation for 
the origin of the hierarchically organized layers of information needed to 
build the animal forms that arose in the Cambrian period. 


LOCATION, LOCATION, LOCATION 


There is another remarkable aspect of the hierarchical organization of 
information in animal forms. Many of the same genes and proteins play 
very different roles, depending upon the larger organismal and 
informational context in which they find themselves in different animal 
groups.” For example, the same gene (Pax-6 or its homolog, called 
eyeless), helps to regulate the development of the eyes of fruit flies 
(arthropods) and those of squid and mice (cephalopods and vertebrates, 
respectively). Yet arthropod eyes exemplify a completely different 
structure from vertebrate or cephalopod eyes. The fruit fly possesses a 
compound eye with hundreds of separate lenses (ommatidia), whereas 
both mice and squid employ a camera-type eye with a single lens and 


retinal surface. In addition, although the eyes of squid and mice 
resemble each other optically (single lens, large internal chamber, single 
retinal surface), they focus differently. They undergo completely 
different patterns of development and utilize different internal structures 
and nerve connections to the visual centers of the brain. Yet the Pax-6 
gene and its homologs play a key role in regulating the construction of 
all three of these different adult sensory structures. Moreover, 
evolutionary and developmental biologists have found that this pattern 
of “same genes, different anatomy” recurs throughout the bilaterian 
phyla, for features as fundamental as appendages, segmentation, the gut, 
heart, and sense organs (see Fig. 18.2).?° 

This pattern contradicts the expectations of textbook evolutionary 
theory. Neo-Darwinism predicts that disparate adult structures should be 
produced by different genes. This prediction follows directly from the 
neo-Darwinian assumption that all evolutionary (including anatomical) 
transformations begin with mutations in DNA sequences—mutations 
that are fixed in populations by natural selection, genetic drift, or other 
evolutionary processes. The arrow of causality flows one way from 
genes (DNA) to development to adult anatomy. Thus, if biologists 
observe different animal forms, it follows that they should expect that 
different genes will specify those forms during animal development. 
Given the profound differences between the fruit-fly compound eye and 
the vertebrate camera eye, neo-Darwinian theory would not predict that 
the “same” genes would be involved in building different eyes in 
arthropods and chordates.*4 
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FIGURE 18.2 
The same genes can be used in different animals to produce dramatically 
different structures, contradicting neo-Darwinian expectations. 


Many leading evolutionary theorists have acknowledged this 
problem. University of Wisconsin evo-devo researcher Sean B. Carroll 
has noted that the neo-Darwinian prediction of similar genes producing 
similar structures is “entirely incorrect.”? Stephen Jay Gould described 
the discovery of the polyfunctional role of similar genes as “explicitly 
unexpected” and “discombobulat[ing] the confident expectations of 
orthodox theory.””° 

The theory of intelligent design suggests a solution to the problem—a 
solution familiar to us from the construction and operation of our own 
artifacts. Figure 18.3 shows a general-purpose switching transistor. 
These electronic components can be used to help build many electronic 
systems, from a computer to a microwave oven to a radio. And the exact 
functional role that the transistor will play will be governed by the 
system in which it finds itself. (One must, of course, make allowances 
for the particular specifications of the transistor itself; a transistor cannot 
function as a battery). 

Nowhere, however, is this feature of polyfunctional modularity more 
intuitively clear than in our use of natural languages, such as English. To 
illustrate this, my colleague Paul Nelson once “disassembled” the last 
forty-four words of Abraham Lincoln’s Gettysburg Address (see Fig. 
18.4) into a lexicon. Using the same words, at roughly the same 
frequency, he then wrote an “Anarchist’s Manifesto,” with a meaning 
diametrically opposite to that of Lincoln’s. What had changed were not 
the words—that is, the lower-level modules. Rather, the higher-level 
context, or the system as a whole, differed. Lincoln wrote with one 
meaning in mind, the anarchist with another—and that made all the 
difference to the functions of the elements, or modules, within the 
respective systems. Nelson calls this context dependency “the 
organismal context principle” when it occurs in biology and he likens it 


to the context dependence of words in language or low-level parts in a 
technological system. 





FIGURE 18.3 

A general purpose transistor—an example of a component that can be 
used to perform different functions in different designed systems. 
Courtesy iStockphoto.com/S23 0. 
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GETTYSBURG ADDRESS 


...that from these honored dead we take 
increased devotion to that cause for 

which they gave the last full measure of 
devotion—that we here highly resolve that 
these dead shall not have died in vain— 
that this nation, under God, shall have a 
new birth of freedom—and that govern- 
ment of the people, by the people, for the 
people, shall not perish from the earth. 
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_AN ANARCHISTS MANIFESTO 


...by this we highly resolve that we shall 
have freedom from this nation—that 
devotion to government shall perish. 
These people honored the last government 
in vain. The dead increased. Measure that 
full devotion! The earth under here gave 
these people birth, not a dead God. And 
for that they shall take their new cause, 

for which people have not died. 


FIGURE 18.4 

This comparison of Lincoln’s Gettysburg Address and an “Anarchist’s 
Manifesto” shows how the same modular elements (words) can perform 
different functions depending on their surrounding context, just as many 
genes do in biological systems. Courtesy Paul Nelson. 


He also argues that intelligent design provides a compelling 
explanation for the presence of polyfunctional modularity in living 
systems. Why? Not only is the polyfunctionality of genetic modules 
unexpected in a neo-Darwinian view, it is a common feature of 
intelligently designed systems. As Nelson and Jonathan Wells note, “An 
intelligent cause may reuse or redeploy the same module in different 
systems, without there necessarily being any material or physical 
connection between those systems.””’ They also observe that intelligent 
agents “can generate identical patterns independently” and put them to 
different uses in different systems of parts: 


If we suppose that an intelligent designer constructed organisms 
using a common set of polyfunctional genetic modules—just as 
human designers, for instance, may employ the same transistor or 
capacitor in a car radio or a computer, . . . then we can explain 
why we find the “same” genes expressed in the development of 
what are very different organisms. . . . A particular gene, 
employed for its DNA-binding properties, finds its functional role 
in a higher-level system whose ultimate origin was intelligently 
caused?’ 


Wells and Nelson go on to explain that “the overall system, not the 
gene itself” determines the ultimate functional significance of the lower 
level modules, just as it does in all human technological or 
communication systems. Certainly, both the computer software and 
computer hardware (integrated circuits) exhibit this feature—what might 
be called “context-dependent, polyfunctional modularity.” Similarly, in 
information-rich texts—such as the Gettysburg Address or the 





Anarchist’s Manifesto—humans convey different meanings with the 
same low-level modules (words) depending upon their surrounding 
context. Experience shows that when we know how systems possessing 
this feature arose, invariably they arose by intelligent design. 


FEATURES OF THE PRE-CAMBRIAN-CAMBRIAN FOSSIL 
RECORD 


Intelligent design not only helps to explain many key features of the 
Cambrian animals themselves; it also helps to explain many otherwise 
anomalous features of the Cambrian fossil record. 


An Inverted Cone: Disparity Preceding Diversity 


As discussed in Chapter 2, the fossil record shows a “top-down” pattern 
in which phyla-level morphological disparity appears first followed only 
later by species-level diversity. Major innovations in body plans precede 
minor variations on basic designs.”? This “inverted cone of diversity” 
also suggests intelligent design. 

Neo-Darwinism seeks to explain the origin of novel body plans by 
starting with simpler body plans and gradually assembling animals with 
more complex body plans via the gradual accumulation of small 
successive material variations. Thus, neo-Darwinism employs a 
“bottom-up” mode of causation. With a bottom-up approach, small-scale 
diversification should eventually produce large-scale morphological 
disparity—differences in body plan. The “bottom-up” metaphor thus 
describes a kind of self-assembly in which the gradual production of the 
material parts eventually generates the organization of the whole. This 
suggests in turn that the parts stand causally prior to the organization of 
the whole. As I have argued, however, this approach encounters both 
paleontological and biological difficulties: the fossil record leaves no 
evidence of such a process and the morphological innovation and 
transformations that it requires are, in any case, biologically implausible. 


But if a bottom-up approach fails, perhaps a “top-down” approach 
will succeed. “Top-down” causation begins with a basic architecture, 
blueprint, or plan and then proceeds to assemble parts in accord with it. 
The blueprint stands causally prior to the assembly and arrangement of 
the parts. But where could such a blueprint come from? One possibility 
involves a mental mode of causation. Intelligent agents often conceive of 
plans prior to their material instantiation—that is, the preconceived 
design of a blueprint often precedes the assembly of parts in accord with 
it. An observer touring the parts section of a General Motors plant will 
see no direct evidence of a prior blueprint for GM’s new models, but 
will perceive the basic design plan immediately upon observing the 
finished product at the end of the assembly line. Designed systems, 
whether automobiles, airplanes, or computers, invariably manifest a 
design plan that preceded their first material instantiation. But the parts 
do not generate the whole. Rather, an idea of the whole directed the 
assembly of the parts. 

This form of causation can certainly explain the pattern in the fossil 
record. As new species appear in the Cambrian, they manifest 
completely novel, morphologically disparate, and functionally integrated 
body plans. Thus, although the fossil record does not directly establish 
the existence of a prior mental plan or blueprint, such a plan could 
certainly explain, or be inferred from, the top-down pattern of fossil 
evidence. In other words, if the body plans of the Cambrian animals did 
arise as the result of a “top-down” mode of causation involving a 
preconceived design plan, we would expect, based on our experience of 
complex designed systems, to find precisely the pattern of evidence that 
we do see in the fossil record. Further, materialistic “bottom-up” models 
of causation fail to explain this same pattern of fossil evidence. Thus, 
intelligent design provides a better explanation of this feature of the 
Cambrian fossil record than do competing materialistic evolutionary 
theories. 

The design hypothesis can also explain why smaller-scale diversity 
arises after, not before, morphological disparity in the fossil record or, to 
put it more poetically, why the basic themes of life precede the variation 


on those themes. Complex designed systems have a fundamental 
functional integrity that makes their alteration difficult. For this reason, 
we should not expect gradual mechanisms of change to produce new 
body plans or alter them fundamentally after they have arisen. We might, 
however, expect to find variations on these basic themes within the 
functional limits established by a basic architecture or body plan. 
Fundamentally new forms of organization require design from scratch. 
For example, airplanes did not arise gradually or incrementally from 
automobiles. Nevertheless, new innovations often accrete to novel 
designs provided the fundamental organizational plan is not altered. 

Since the invention of the automobile, all cars have included the same 
basic structural and functional elements, including a motor, at least three 
(and usually four) wheels, a carriage with seats for passengers, a 
structure connecting the wheels to the carriage, a steering wheel and 
column (or analogous mechanism), and a means of translating energy 
generated by the motor to the wheels. These are minimal requirements, 
of course; many cars have used axles to connect the wheels, though 
some have not and a “stretch” limousine may need additional axles or 
wheels. Indeed, though many new variations on the original model have 
arisen after the invention of the basic automobile design, all exemplify 
this same basic design. Interestingly, we also observe this pattern in the 
fossil record. The major animal body plans appear first instantiated by a 
single (or very few) species or genera. Then, later, many other varieties 
arise with many new features, yet all still exhibit the same basic body 
plan. 

Experience shows a hierarchical relationship between functionally 
necessary and functionally optional features in designed systems. An 
automobile cannot function without a motor or steering mechanism; it 
can function with or without twin I-beam suspension, antilock brakes, or 
“stereo surround sound.” This distinction between functionally 
necessary and optional features suggests the possibility of future 
innovation and variation on basic design plans, even as it imposes limits 
on the extent to which the basic designs themselves can be altered. The 
logic of designed systems, therefore, suggests precisely the kind of top- 


down pattern that we see in both the history of our own technological 
innovation?’ and in the history of life following the Cambrian explosion 
(compare Figs. 18.5a and 18.5b). On the other hand, competing 
materialistic evolutionary theories would not lead us to expect the fossil 
record to manifest such a “top-down” pattern, but the opposite. 


Sudden Appearance and Missing Ancestors 


The theory of intelligent design can also help to account for the abrupt 
appearance of complex anatomical structures and animal body plans in 
the fossil record. Intelligent agents sometimes produce material entities 
through a series of gradual modifications (as when a sculptor shapes a 
sculpture over time). Nevertheless, intelligent agents also have the 
capacity to introduce complex technological systems into the world fully 
formed. Often such systems bear no resemblance to earlier technological 
systems—their invention occurs without a material connection to earlier, 
more rudimentary technologies. When the radio was first invented, it 
was unlike anything that had come before, even other forms of 
communication technology. For this reason, although intelligent agents 
need not generate novel structures abruptly, they can do so. Thus, 
invoking the activity of a mind provides a causally adequate explanation 
for the pattern of abrupt appearance in the Cambrian fossil record. 
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FIGURE 18.5 

Figure 18.5a (top): A top-down pattern of appearance found in the 
history of animal life. Figure 18.5b (bottom): A top-down pattern of 
appearance in human technology. 


On the other hand, strictly materialistic theories of evolution 
necessarily envision a “bottom-up” mode of causation in which material 
parts or materially instantiated intermediate forms of organization 
necessarily precede the emergence of fully developed body plans. For 
this reason, the sudden appearance of novel animal forms contradicts the 
expectations of most materialistic evolutionary theories. Neo- 
Darwinism, in particular, would not expect the sudden appearance of 
animal forms. As Darwin himself insisted: “Natura non facit saltum” 
(“Nature takes no leaps”). Yet intelligent agents can act suddenly or 
discretely in accord with their powers of rational choice or volition, even 
if they do not always do so. Thus, the sudden appearance of the 
Cambrian animals does suggest, at least, the possibility of a volitional 
act of a conscious agent—a designer. 

Intelligent design likewise helps explain the absence of ancestral 
precursors. If body plans arose as the result of an intelligent agent 
actualizing an immaterial plan or idea, then an extensive series of 
material precursors to the first animals need not exist in the fossil record, 
anymore than such a series is always present in the history of 
technology. The radio did not evolve gradually from the telegraph. 
Mental plans or concepts need not leave a material trace. Thus, 
intelligent design can account for the dearth of material precursors in the 
Precambrian strata, whereas “bottom-up” materialistic evolutionary 
theories cannot, especially given the failure of the artifact hypothesis 
discussed earlier. 


Stasis (or Persistent Morphological Isolation) 


Finally, intelligent design also explains the observed stasis in the fossil 
record. As advocates of punctuated equilibrium established, Cambrian 
species tend to persist unchanged in their basic forms over time. Animal 


body plans that define the higher taxa, including classes and phyla, also 
remain especially stable in their basic architectural designs, showing “no 
directional change”?! over geological history after their first appearance 
in the Cambrian. As a result, the morphological disparity between 
distinct animal body plans remains unbridged. Moreover, as noted in 
Chapter 13, developmental mechanisms constrain the degree to which 
organisms may vary without deleterious consequences. 
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FIGURE 18.6 


Comparison of persistent morphological isolation in two animal forms that first arose in the 
Cambrian period, and similar morphological isolation in two technological systems. 


The persistent morphological disparity and isolation of animal body 
plans is completely unexpected by neo-Darwinian, and all other 
gradualist, evolutionary theories, at least. Given such models of 
evolutionary change, theorists should expect the fossil record to exhibit 
forms of life that grade imperceptibly from one to another. Indeed, 
absent either a compelling version of the artifact hypothesis or an 
adequate mechanism of punctuated large-scale evolutionary change, 
“morphospace” should be mostly filled in. The fossil record should not 
display mostly morphologically disparate or separate forms of animal 
life. 

Yet, if living systems did arise as the result of intelligent design, then 
such morphological isolation, and such persistent isolation over time, is 
just what we should expect to see—precisely because that is what we do 
see in the history of other intelligently designed systems (see Fig. 18.6). 
Indeed, experience suggests that designed objects have a functional 
integrity that makes the modification of some of their essential parts and 
their basic organization and architecture difficult or impossible. Though 
the Model-A has been replaced by everything from the Yugo to the 
Honda Accord, the automobile “body plan” with several essential 
functional and/or structural elements has remained unchanged from its 
first appearance in the late nineteenth century. 

Further, despite the design of many innovative variations, 
automobiles have retained their “morphological distance” or structural 
disparity from other functionally distinct technological devices. Indeed, 
persistent morphological disparity in biological systems (manifested as 
stasis in the fossil record) has a direct parallel in our own technology. In 
biology, what we recognize as different organismal body plans are 
systems that differ fundamentally from each other in their overall 
organization. A crab and a starfish, for example, may exhibit some 
similarities in their low-level protein parts, but they differ fundamentally 
in their digestive and nervous systems and in the overall organization of 
their organs and body parts. In the same way, automobiles and airplanes 


may have many similar parts, even as they differ in the composition of 
their distinguishing parts and overall organization. 

The presence of such structural disparities and isolation among 
complex functionally integrated systems represents another distinctive 
feature of the intelligently designed systems known from our own world 
of technology. For example, the basic technology of the CD-ROM (as 
employed in audio systems and computers) did not “evolve” 
incrementally from earlier technologies, such as magnetic media (e.g., 
digital tape or disc storage) or analog systems such as the once standard 
long-playing (LP) record. Indeed, it could not. In an analog recording, 
information is stored as three-dimensional microscopic grooves in a 
vinyl surface and is detected mechanically by a diamond stylus. This 
means of storing and detecting information differs fundamentally, as a 
system, from the digitally encoded pits storing data in the silvered 
surface of a CD-ROM, where information is detected optically, not 
mechanically, by a laser beam. The CD-ROM had to be engineered from 
scratch and, as a result, displays a striking structural difference and 
isolation from other technological devices, even those that perform 
roughly the same function. Although minor new features may “accrete” 
to its basic design architecture, a deep and impassable functional gulf 
separates the CD-ROM as a system from other technological systems. 
As biologist Michael Denton expresses it, “What is true of sentences and 
watches is also true of computer programs, airplane engines, and in fact 
of all known complex systems. Almost invariably, function is restricted 
to unique and fantastically improbable combinations of subsystems, tiny 
islands of meaning lost in an infinite sea of incoherence.”?? In fact, such 
structural disparity or morphological isolation constitutes a diagnostic 
of designed systems—that is, a feature of systems for which only one 
kind of cause—an intelligent cause—is known. 


ACTS OF MIND 


Studies in the history and philosophy of science have shown that to 
explain an event or a set of facts, scientists must typically cite a cause 


capable of producing that event or those facts. When scientists do not 
have the luxury of directly observing the cause of a particular event or 
effect under study, as historical scientists typically do not, they must cite 
a cause that 1s otherwise known to produce the facts in question. That 
means historical scientists must show that the event or facts of interest 
must in some way represent the expected outcome of a particular cause 
having acted in the past—that the event or facts should have occurred 
“as a matter of course.” 

To many scientists, especially those steeped in the materialistic 
assumptions of contemporary scientific culture, the idea of intelligent 
design seems inherently implausible or even incoherent. Science to them 
involves not only observing and studying material entities and 
phenomena, but explaining them by reference to materialistic entities. 
For these scientists, it makes no sense even to consider the idea of 
intelligent design, with its explicit reference to the activity of a 
designing mind. 

Yet it turns out that both the Cambrian animal forms themselves and 
their pattern of appearance in the fossil record exhibit precisely those 
features that we should expect to see if an intelligent cause had acted to 
produce them (see Fig. 18.7). Further, the Cambrian animal forms and 
their manner of appearance contradict what we should expect to find in 
the fossil record and in the animal world given a purely materialistic 
“bottom-up” process of evolution. Thus, despite its potential for 
disturbing the materialistic sensibilities of many scientists, it is hard 
logically to avoid the conclusion that the design hypothesis actually 
provides a better, more casually adequate explanation for key features of 
the Cambrian event. 

When Darwin first acknowledged the problem of the Cambrian fossil 
record, and the small but persistent doubt it raised for him about his 
theory, his nemesis Louis Agassiz not only rejected his theory of 
evolution, but also affirmed an alternative understanding of the nature 
and origin of animal life. To Agassiz, the pattern of animal classification 
and the fossil record reinforced the idea that living forms exemplified 
basic “types’—ideas that had originated in the mind of a designing 





intelligence. Thus, he would argue that the Cambrian fossils tell of “acts 
of mind.”>? 

As noted in Chapter 1, Darwin himself acknowledged both Agassiz’s 
immense paleontological knowledge and the validity of the problems 
that Agassiz raised. Even so, his affirmation of a positive alternative to 
Darwin’s theory in the form a design hypothesis might well have seemed 
premature in the 1860s and certainly did reflect something of the 
prejudice of the times. But more than a century and a half later, after 
many failed attempts to discover—and explain away—the missing fossil 
ancestors, and after discoveries in molecular and developmental biology 
have revolutionized our understanding of the complexity of animal life, 
continuing to regard the Cambrian explosion as merely a niggling 
problem for established theory—a lone question mark or negative clue— 
now seems not so much cautious, as simply unresponsive to the 
evidence. 
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FIGURE 18.7 
Both the Cambrian animal forms and their pattern of appearance in the 
fossil record exhibit distinctive features or hallmarks of designed 
systems—features that we should expect to see if an intelligence acted to 
produce them. 


The animal forms that arose in the Cambrian not only did so without 
any clear material antecedent; they came on the scene complete with 
digital code, dynamically expressed integrated circuitry, and multi- 
layered, hierarchically organized information storage and processing 
systems. 

In light of these marvels and the persistent pattern of the fossil 
record, should we now continue, as Darwin did (who knew nothing of 
them), to regard the Cambrian explosion as just an anomaly? Or may we 
now consider the features of the Cambrian event as evidence supporting 
another view of the origin of animal life? If so, is there now a 
compelling logic for considering a different kind of causal history? 

In fact, there is. The features of the Cambrian event point decisively 
in another direction—not to some as-yet-undiscovered materialistic 
process that merely mimics the powers of a designing mind, but instead 
to an actual intelligent cause. When we encounter objects that manifest 
any of the key features present in the Cambrian animals, or events that 
exhibit the patterns present in the Cambrian fossil record, and we know 
how these features and patterns arose, invariably we find that intelligent 
design played a causal role in their origin. Thus, when we encounter 
these same features in the Cambrian event, we may infer—based upon 
established cause-and-effect relationships and uniformitarian principles 
—that the same kind of cause operated in the history of life. In other 
words, intelligent design constitutes the best, most causally adequate 
explanation for the origin of information and circuitry necessary to build 
the Cambrian animals. It also provides the best explanation for the top- 
down, explosive, and discontinuous pattern of appearance of the 
Cambrian animals in the fossil record. 


19 


The Rules of Science 


The argument of the previous chapter raises an obvious question. If 
intelligent design provides such a clear and satisfying resolution to the 
mystery of the Cambrian explosion, why have so many brilliant 
scientists missed it? 

While reflecting on this question, I came across a short story by G. K. 
Chesterton called “The Invisible Man,” which may cast some light on it. 
In “The Invisible Man,” Chesterton tells the story of someone who is 
murdered in an apartment with only one entrance, an entrance watched 
by four honest men. These men insist that during their watch no one 
entered or left the building. A brilliant French detective investigates the 
case, along with his friend, a dusty little Catholic priest. They query the 
guards, each of whom insists that no one entered or exited the building. 
But then the unimpressive looking priest, Father Brown, all but forgotten 
in the background, pipes up to ask, “Has nobody been up and down 
stairs, then, since the snow began to fall?” 

“Certainly not,” they assure him. 

“Then I wonder what that is?” Father Brown asks, gazing at the white 
snow on the outside entrance stairs. Everyone turns to find a “stringy 
pattern of grey footprints” there. 

“God!” one of them cries, “An invisible man!” 

After asking a few more questions, Father Brown quickly unravels 
the mystery. “When those four quite honest men said that no man had 
gone into the Mansions, they did not really mean that no man had gone 
into them,” Father Brown explains to his detective friend. “They meant 


no man whom they could suspect of being your man. A man did go into 
the house and did come out of it, but they never noticed him.” 

“An invisible man?” 

“A mentally invisible man,” the priest explains. 

What does a mentally invisible man look like? 

“He is dressed rather handsomely in red, blue and gold,” the priest 
explains, “and in this striking, and even showy, costume he entered 
Himylaya Mansions [the name of the apartment complex] under eight 
human eyes; he killed . . . [the murder victim] in cold blood, and came 
down into the street again carrying the dead body. .. . You have not 
noticed such a man as this.” 

At that moment, he reaches out and puts his hand on “an ordinary 
passing postman,” one who had almost slipped by them unnoticed. 

“Nobody ever notices postmen somehow,” Father Brown muses. “Yet 
they have passions like other men, and even carry large bags where a 
small corpse can be stowed quite easily.”! 

The passing postman, of course, is the murderer. He walked up and 
down the stairs under the four men’s noses, but because of their mental 
blinders telling them whom to consider and whom to ignore, they 
overlooked the postman entirely. 

The theme is a favorite of detective-story authors: the obvious 
possibility missed by the experts, because their assumptions prevent 
them from considering what might otherwise seem to be an obvious 
possibility. Could something like that be at work in the investigation of 
the (Cambrian explosion? Could evolutionary biologists and 
paleontologists be wearing a set of mental blinders that keeps them from 
considering a possible explanation of the Cambrian mystery? 

Odd as it may seem, that is exactly what has been going on in the 
investigation of the Cambrian explosion. In this case, however, those 
wearing the mental blinders have elevated an unwillingness to consider 
certain explanations to a principle of scientific method. That principle is 
called “methodological naturalism” or “methodological materialism.” 
Methodological naturalism asserts that to qualify as scientific, a theory 
must explain phenomena and events in nature—even events such as the 


origin of the universe and life or phenomena such as human 
consciousness—by reference to strictly material causes. According to 
this principle, scientists may not invoke the activity of a mind or, as one 
philosopher of science puts it, any “creative intelligence.” 

To see how adherence to this principle has prevented scientists from 
considering a possibly true (even “causally adequate”) explanation for 
the Cambrian explosion, let’s revisit the case reported in Chapter 11 of 
Richard Sternberg (see Fig. 19.1), the evolutionary biologist at the 
Smithsonian’s National Museum of Natural History. After Sternberg 
published my article arguing for intelligent design as the best 
explanation of the Cambrian information explosion in the technical 
journal Proceedings of the Biological Society of Washington,’ he 
suffered professional retribution at the hands of Smithsonian 
administrators.4 The Biological Society of Washington, the governing 
body that oversees the publication of the journal that Sternberg then 
edited, also issued a public statement repudiating his decision.’ Its 
statement did not, however, cite any factual errors in the article or seek 
to rebut it. Further, the president of the society, Smithsonian zoologist 
Roy McDiarmid, wrote Sternberg privately and told him that he 
(McDiarmid) had reviewed the file containing the peer-review reports 
and had found everything to be in order.® 

What, then, had Sternberg done to deserve public rebuke? 

Sternberg published a paper that violated a presumed rule of science: 
methodological naturalism. Without saying it in so many words, the 
Biological Society made crystal clear that this was the crucial issue. 
When it distanced itself from Sternberg and the review essay, it did not 
invite a scientific refutation of the article, as if the problem had been a 
misrepresentation or misinterpretation of the evidence. Instead, it 
attempted to settle the issue by releasing a policy statement. As a writer 
in the Wall Street Journal reported at the time, “The Biological Society 
of Washington released a vaguely ecclesiastical statement regretting its 
association with the article. It did not address its arguments but denied 
its orthodoxy, citing a resolution of the American Association for the 


Advancement of Science that defined ID [the theory of intelligent 
design] as, by its very nature, unscientific.” 





FIGURE 19.1 
Richard Sternberg. Courtesy Laszlo Bencze. 


The Biological Society of Washington “deemed the paper 
inappropriate for the pages of the Proceedings.” The Society attempted 
to justify this claim, first, on thin procedural grounds, claiming that a 
paper about the origin of animal body plans represented a “departure” 
from its more typical concern with issues of animal classification. 
Second, and more tellingly, it cited the policy statement of the American 
Association for the Advancement of Science (AAAS) “calling upon its 
members to understand the nature of science” and to recognize “the 
inappropriateness of ‘intelligent design theory’ as subject matter for 
science education.” Setting aside the obvious point that my paper was 
written not as a curricular manifesto but as an evidence-based scientific 
argument, the AAAS statement affirmed an implicitly and strictly 
materialistic understanding of the nature of science. It did so to 
disqualify intelligent design from consideration—not only in science 
education, but in science itself. 

The Sternberg case—like numerous others in which the academic 
freedom of scientists advocating intelligent design has been abridged!” 
—goes a long way to answering the question of why so many otherwise 
brilliant and knowledgeable scientists have overlooked such a seemingly 
obvious possible answer to the Cambrian conundrum. As in Chesterton’s 
story about the invisible postman, they have accepted a self-imposed 
limitation on the hypotheses they are willing to consider. These scientists 
think they are doing their duty to science. Yet if researchers refuse as a 
matter of principle to consider the design hypothesis, they will obviously 
miss any evidence that happens to support it. And the cultural pressure 
within biology to avoid considering the intelligent design hypothesis has 
long been nontrivial. Francis Crick, for example, famously admonished 
biologists to “constantly keep in mind that what they see was not 
designed, but rather evolved.”!! In 1997, in an article in the New York 
Review of Books, Harvard geneticist Richard Lewontin made explicit a 
similar commitment to a strictly materialistic explanation—whatever the 


evidence might seem to indicate. As he explained in a now often quoted 
passage: 


We take the side of science in spite of the patent absurdity of some 
of its constructs, in spite of its failure to fulfill many of its 
extravagant promises of health and life, in spite of the tolerance of 
the scientific community for unsubstantiated just-so stories, 
because we have a prior commitment, a commitment to 
materialism. It is not that the methods and institutions of science 
somehow compel us to accept a material explanation of the 
phenomenal world, but, on the contrary, that we are forced by our 
a priori adherence to material causes to create an apparatus of 
investigation and a set of concepts that produce material 
explanations, no matter how counter-intuitive, no matter how 
mystifying to the uninitiated. Moreover, that materialism is 


absolute, for we cannot allow a Divine Foot in the door. 12 


The commitment to methodological naturalism that Lewontin 
describes, as well as the behavior of scientists in cases such as 
Sternberg’s, leave no doubt that many in science simply will not 
consider the design hypothesis as an explanation for the Cambrian 
explosion or any other event in the history of life, whatever the 
evidence. To do so would be to violate the “rules of science” as they 
understand them. 


BUT IS IT SCIENCE? 


But are these scientists right? Perhaps science must limit itself to purely 
naturalistic or materialistic explanations. If so, are there perhaps good 
reasons for excluding the design hypothesis from consideration as a 
scientific hypothesis? Is methodological naturalism the correct policy for 
science? 

Though scientists routinely assert methodological naturalism as a 
scientific norm, that principle and its exclusion of the design hypothesis 


have proven difficult to justify. To claim that a specific theory does not 
qualify as scientific requires a definition of science or a set of 
definitional criteria by which to make that kind of a judgment. Some 
philosophers and scientists have asserted that for a scientific theory to 
qualify as scientific, it must meet various criteria of testability, 
falsifiability, observability, repeatability, and the like. Philosophers of 
science call these “demarcation criteria,” because some scientists purport 
to use them to define or “demarcate” science and to distinguish it from 
pseudo science or from other forms of inquiry such as history, religion, 
or metaphysics. !? 


THE GENERAL PROBLEM OF DEMARCATION 


The demarcation question has long been a vexing one. Historically, 
scientists and philosophers of science have thought that science could be 
distinguished by its especially rigorous method of study. But attempts to 
define science by reference to a distinctive method have proven 
problematic because different branches and types of science use 
different methods. 

For example, some scientific disciplines distinguish and classify 
natural entities, while others attempt to formulate overarching laws that 
apply to all entities. Some disciplines perform laboratory experiments 
under controlled and replicable conditions, while others attempt to 
reconstruct or explain singular events in the past, often based on field 
studies of evidence or clues rather than laboratory experiments. Some 
disciplines generate mathematical descriptions of natural phenomena 
without positing mechanisms to explain them. Others look for 
mechanisms or explain law-like regularities by reference to underlying 
mechanisms. Some scientific disciplines make predictions to test 
theories, while others test competing theories by comparing their 
explanatory power. Some disciplines use both these methods, while 
some conjectures (particularly in theoretical physics) may not be testable 
at all. And on it goes. 


An episode in the history of science illustrates the problem. During 
the seventeenth century, a group of scientists called the “mechanical 
philosophers” insisted, based largely on advances in early chemistry, that 
scientific theories must provide mechanistic explanations. Such 
explanations had to involve one material entity pushing or pulling 
another. Yet in physics, Isaac Newton (1642-1727) formulated an 
important theory that provided no mechanistic explanation. His theory of 
universal gravitation described mathematically, but did not explain in a 
mechanistic way, the gravitational attraction between planetary bodies— 
bodies separated from each other by miles of empty space with no 
means of mechanical interaction with each other whatsoever.!* Despite 
provocation from the German mathematician Gottfried Wilhelm Leibniz 
(1646-1716), who defended the mechanistic ideal, Newton expressly 
refused to give any explanation, mechanistic or otherwise, for the 
mysterious “action at a distance” that his theory described. !> 

Did that make Newton’s theory unscientific? Strictly speaking, the 
answer depends upon which definition of science someone chooses to 
apply. Today one would be hard-pressed to find anybody who denies that 
Newton’s famous theory qualified as scientific. Yet we could easily find 
scientists still willing to say scientific theories must provide mechanisms 
as well as others who would deny as much. 

And that is the problem. If scientists and philosophers of science do 
not have an agreed-upon definition of science, how can they settle 
questions about which theories do and do not qualify as scientific? If 
scientists lack such a definition, it’s difficult to argue that any particular 
theory is unscientific by definition. For this reason, philosophers of 
science, the scholars who study the nature and definition of science, now 
almost universally reject the use of demarcation arguments to decide the 
validity of theories or settle competition between them.!° They 
increasingly regard demarcation as an essentially semantic question and 
nothing more. Is theory X scientific or not? Answer: that depends upon 
which definition of science is used to decide the question. 

Moreover, as the philosopher of science Larry Laudan has shown in a 
seminal article, “The Demise of the Demarcation Problem,” attempts to 


apply demarcation criteria to decide the scientific status of specific 
theories have invariably generated irreconcilable contradictions.!’ The 
vortex theory of gravity that Newton’s theory replaced envisioned 
planets swirling around the sun pushed by a substance called ether.!® It 
did provide a mechanistic explanation for gravitational attraction. It 
failed, however, to explain the evidence and was judged by Newton and 
physicists following him to be manifestly false. Nevertheless, because it 
proposed a mechanistic cause of gravitation, it qualified as 
“scientific” —at least given the conception of science favored by Leibniz 
and the mechanical philosophers.'? Conversely, Newton’s theory failed 
to as scientific by their definition, though it much more accurately fit the 
evidence. 

Such contradictions have long beset the whole enterprise of 
demarcation. Theories that scientists have rejected as false because of 
their inability to explain or describe the evidence often meet the very 
criteria or methodological features (testability, falsifiability, 
repeatability, observability, etc.) that allegedly characterize true science. 
On the other hand, many highly esteemed or successful theories often 
lack allegedly necessary features of genuine science. 

Thus, philosophers of science generally think it much more important 
to assess whether a theory is true, or whether the evidence supports it, 
than whether it should or should not be classified as “science.” The 
question of whether a theory is “scientific” is really a red herring. What 
we really want to know is whether a theory is true or false, supported by 
the evidence or not, worthy of our belief or not. And we cannot decide 
those questions by applying a set of abstract criteria that purport to tell in 
advance what all good scientific theories must look like.?° 


DEFINE AND DISMISS: DEMARCATION ARGUMENTS 
AGAINST INTELLIGENT DESIGN 


The rejection of demarcation arguments among philosophers of science 
has not stopped critics of intelligent design from attempting to settle 
debates about biological origins by the expedient of formulating such 


arguments against intelligent design. Some use these arguments to 
justify methodological naturalism (which has the same effect). 

Advocates of methodological naturalism have argued that the theory 
of intelligent design is inherently unscientific for some, or all, of the 
following reasons: (a) is not testable,*! (b) is not falsifiable,” (c) does 
not make predictions,” (d) does not describe repeatable phenomena, (e) 
does not explain by reference to natural law,” (f) does not cite a 
mechanism,~> (g) does not make tentative claims,’ and (h) has no 
problem-solving capability.” They have also claimed that it is not 
science because it (i) refers to an unobservable entity.7* These critics 
also assume, imply, or assert that materialistic evolutionary theories do 
meet such criteria of proper scientific method. 

Readers may wish to consult Signature in the Cell for a more detailed 
response to these specific arguments. There I show that many of these 
claims are simply false (e.g., contrary to the claims of its critics 
intelligent design is testable; it does make predictions; it does formulate 
its claims tentatively; and it does have scientific problem-solving 
capability). But I also show that when the claims of those making 
demarcation arguments are true—when intelligent design doesn’t meet a 
specific criterion—that fact does not provide good reason for excluding 
intelligent design from consideration as a scientific theory. Why? 
Because the materialistic evolutionary theories that intelligent design 
challenges, theories widely regarded by convention as “scientific,” fail 
to meet the very same demarcation standard. In other words, there is no 
defensible definition of science, and no specific demarcation criterion, 
that justifies both excluding intelligent design from science and 
including competing materialistic evolutionary theories. Instead, 
attempts to use demarcation criteria specifically to disqualify intelligent 
design as a scientific theory have repeatedly failed to differentiate the 
scientific status of intelligent design from that of competing theories. 
Depending upon which criteria are used to adjudicate their scientific 
status, and provided metaphysically neutral criteria are selected to make 
such assessments, intelligent design and materialistic origins theories 
invariably prove equally scientific or unscientific. 


For example, some critics of intelligent design have argued that it 
fails to qualify as a scientific theory because it makes reference to an 
unseen or unobservable entity, namely, a designing mind in the remote 
past. Yet many accepted theories—theories assumed to be scientific— 
postulate unobservable events and entities. Physicists postulate forces, 
fields, and quarks; biochemists infer submicroscopic structures; 
psychologists discuss their patients’ mental states. Evolutionary 
biologists themselves infer unobserved past mutations and invoke the 
existence of extinct organisms and transitional forms for which no 
fossils remain. Such things, like the actions of an intelligent designer, are 
inferred from observable evidence in the present, because of the 
explanatory power they may offer. 

If the demarcation criterion of observability is applied rigidly, then 
both intelligent design and materialistic theories of evolution fail to 
qualify as scientific. If the standard is applied more liberally (or 
realistically)—acknowledging the way in which historical scientific 
theories often infer unobservable past events, causes, or entities—then 
both theories qualify as scientific. 

And so it goes with other such criteria as well. There is no specific 
(non-question-begging) demarcation criterion that succeeds in 
disqualifying the theory of intelligent design from consideration as a 
scientific theory without also doing the same to its materialistic rivals. 


REASONS TO REGARD INTELLIGENT DESIGN AS A 
SCIENTIFIC THEORY 


Demarcation arguments fail to justify excluding intelligent design from 
science. But it turns out that there are some good—if convention- 
dependent—reasons to regard intelligent design as a scientific theory. 
For example, many scientists and philosophers of science regard 
testability as an important feature of scientific inquiry. And intelligent 
design is testable in three specific and interrelated ways. First, like other 
scientific theories concerned with explaining events in the remote past, 
intelligent design is testable by comparing its explanatory power with 


that of competing theories. Second, intelligent design, like other 
historical scientific theories, is tested against our knowledge of the 
cause-and-effect structure of the world. As we have discussed, historical 
scientific theories provide adequate explanations when they cite causes 
that are known to produce the effects in question or “causes now in 
operation.””? Because of this, the plausibility of historical scientific 
theories, including intelligent design, can be tested by reference to 
independent knowledge of cause-and-effect relationships. Third, 
although historical scientific theories typically cannot be tested under 
controlled laboratory conditions, they do sometimes generate predictions 
that enable scientists to compare their merit to that of other theories. 
Intelligent design has generated a number of specific empirical 
predictions that distinguish it from competing evolutionary theories and 
that serve to confirm the design hypothesis over its competitors. (In 
Signature in the Cell, | described ten such predictions that the theory of 
intelligent design has generated).°” 

There is another compelling, if convention-dependent, reason to 
regard intelligent design as a scientific theory. The inference to 
intelligent design is based upon the same method of historical scientific 
reasoning and the same uniformitarian principles that Charles Darwin 
used in On the Origin of Species. The similarity in logical structure runs 
quite deep. Both the argument for intelligent design and the Darwinian 
argument for descent with modification were formulated as abductive 
inferences to the best explanation. Both theories address 
characteristically historical questions; both employ typically historical 
forms of explanation and testing; and both have metaphysical 
implications. Insofar as we regard Darwin’s theory as a scientific theory, 
it seems appropriate to designate the theory of intelligent design as a 
scientific theory as well. 

Indeed, neo-Darwinism and the theory of intelligent design are not 
two different kinds of inquiry, as some critics have asserted. They are 
two different answers—formulated using a similar logic and method of 
reasoning—to the same question: “What caused biological forms and the 
appearance of design to arise in the history of life?” It stands to reason 


that if we regard one theory, neo-Darwinism or intelligent design, as 
scientific, we should regard the other as the same. Of course, whether 
either theory is true or not is another matter. An idea may be scientific 
and incorrect. In the history of science, many theories have proven to be 
so. The vortex theory of gravity, to which I referred earlier, would be 
one of nearly countless illustrations. 

For readers who would like to consider more detailed responses to 
arguments about whether intelligent design qualifies as “science,” I 
recommend Chapters 18 and 19 in Signature in the Cell?! In Signature, 
I respond in detail to other philosophical objections to the case for 
intelligent design. These include challenges such as: (a) intelligent 
design is religion, not science,* (b) the case for intelligent design is 
based on flawed analogical reasoning, (c) intelligent design is a 
fallacious argument from ignorance, sometimes called the “God of the 
Gaps” objection, (d) intelligent design is a science stopper, (e) the 
famous zinger, popularized by Richard Dawkins, that asks “Who 
designed the designer??? and many others. 


A NEW OBJECTION TO THE SCIENTIFIC STATUS OF 
INTELLIGENT DESIGN 


Since the publication of Signature in the Cell, Robert Asher, a 
University of Cambridge paleontologist, has offered another reason to 
contest my characterization of intelligent design as a scientific theory. In 
his book, Evolution and Belief, he challenges my claim to have used the 
uniformitarian method of Lyell and Darwin to develop the case for 
intelligent design. Since his objection is new, published only in 2012 by 
Cambridge University Press, it deserves discussion. 

Asher characterizes my thinking as follows: “The processes we know 
and observe today are relevant to explaining the phenomena of the past, 
and we know that particularly complicated things we see today have an 
intelligence behind them.’** He notes that I argue certain complex 
technologies, such as computer software, have “only one source: human 
ingenuity.”*> It follows, according to Asher’s paraphrase of my 


argument that “a similarly complex device we observe in the geological 
past must also have arisen as a result of something like human ingenuity, 
i.e., intelligence.”*° 

Asher doesn’t seem to understand the importance of specified 
information, as opposed to “complicated things,” as a key indicator of 
design. That aside, he does claim to recognize the role of uniformitarian 
principles of reasoning in my argument for intelligent design. In spite of 
this, Asher elsewhere disputes that I employ the uniformitarian method 
of reasoning. Why? According to Asher, the inference to intelligent 
design is actually “anti-uniformitarian” because it doesn’t provide a 
“mechanism.” As he puts it, “by attempting to replace a causal 
mechanism (natural selection) with an attribution of agency (design), ID 
advocates such as Meyer are decidedly anti-uniformitarian. What 
process of today could possibly lead to his understanding of the past?”>” 

The answer to Asher’s question seems pretty obvious. The answer is: 
intelligence. Conscious activity. The deliberate choice of a rational 
agent. Indeed, we have abundant experience in the present of intelligent 
agents generating specified information. Our experience of the causal 
powers of intelligent agents—of “conscious activity” as “a cause now in 
operation”—provides a basis for making inferences about the best 
explanation of the origin of biological information in the past. In other 
words, our experience of the cause-and-effect structure of the world— 
specifically the cause known to produce large amounts of specified 
information in the present—provides a basis for understanding what 
likely caused large increases in specified information in living systems 
in the past. It is precisely my reliance on such experience that makes 
possible an understanding of the type of causes at work in the history of 
life. It also makes my argument decidedly uniformitarian—not “anti- 
uniformitarian’—1in character. 

Asher confuses the uniformitarian imperative in historical scientific 
explanations (the need to cite a presently known or adequate cause) with 
a demand for citing a material cause, or mechanism. The theory of 
intelligent design does cite a cause, and indeed one known to produce 
the effects in question, but it does not necessarily cite a mechanistic or 








materialistic cause. Proponents of intelligent design may conceive of 
intelligence as a strictly materialistic phenomenon, something reducible 
to the neurochemistry of a brain, but they may also conceive of it as part 
of a mental reality that is irreducible to brain chemistry or any other 
physical process. They may also understand and define intelligence by 
reference to their own introspective experience of rational consciousness 
and take no particular position on the mind-brain question. 

Asher assumes that intelligent design denies a materialistic or 
“physicalist” account of the mind (as I personally do, in fact) and rejects 
it as unscientific on that basis. But he offers no noncircular reason for 
making that judgment. He cannot say that the principle of 
methodological naturalism requires that all genuinely scientific theories 
invoke only mechanistic causes, because the principle of methodological 
naturalism itself needs justification. And asserting that “all genuinely 
scientific theories must provide mechanisms” is just to restate the 
principle of methodological naturalism in different words. Indeed, to say 
that all scientific explanations must provide a mechanism is equivalent 
to saying that they must cite materialistic causes—precisely what the 
principle of methodological naturalism asserts. Asher seems to be 
assuming without justification that all scientifically acceptable causes 
are mechanistic or materialistic. His argument thus assumes a key point 
at issue, which is whether there are independent—that is, metaphysically 
neutral—treasons for requiring historical scientific theories to cite 
materialistic causes in their explanations as opposed to explanations that 
invoke possibly immaterial entities such as creative intelligence, mind, 
mental action, agency, or intelligent design. 

In any case, he confuses the logical requirement of citing a vera 
causa, a true or known cause, with an arbitrary requirement to cite only 
materialistic causes. He confuses  uniformitarianism with 
methodological naturalism.** He then critiques my design argument for 
rejecting the former, though it only rejects the latter. In so doing, he 
imposes an additional requirement on explanations of past events that 
leads him to mistake my argument as anti-uniformitarian and to miss the 
evidence for intelligent design. His implicit commitment to 


methodological naturalism makes the evidence for intelligent design 
—‘the postman,” as it were—mentally invisible to him. 

Nevertheless, the concern that he raises about the theory of intelligent 
design not citing a mechanism still troubles people. In fact, I frequently 
get questions about this issue. People will ask something like this: “I can 
see your point about digital code providing evidence for intelligent 
design, but how exactly did the designing intelligence generate that 
information or arrange matter to form cells or animals?” Or: “How did 
the intelligent designer that you infer impress its ideas on matter to form 
animals?” As Asher puts it, “How could a biological phenomenon, even 
if designed, be simply willed into existence without an actual 
mechanism??? 

To help clear things up, several points need to be considered. First, 
the theory of intelligent design does not provide a mechanistic account 
of the origin of biological information or form, nor does it attempt to. 
Instead, it offers an alternative causal explanation involving a mental, 
rather than a necessarily or exclusively material, cause for the origin of 
that reality. It attributes the origin of information in living organisms to 
thought, to the rational activity of a mind, not a strictly material process 
or mechanism. That does not make it deficient as a materialistic or 
mechanistic explanation. It makes it an alternative to that kind of 
explanation. Advocates of intelligent design do not propose intelligent 
causes because they cannot think of a possible mechanistic explanation 
for the origin of form or information. They propose intelligent design 
because they think it provides a better, more causally adequate 
explanation for these realities. Given what we know from experience 
about the origin of information, materialistic explanations are the 
deficient ones. 

There is a different context in which someone might want to ask 
about a mechanism. He or she may wish to know by what means the 
information, once originated, is transmitted to the world of matter. In our 
experience, intelligent agents, after generating information, often use 
material means to transmit that information. A teacher may write on a 
chalkboard with a piece of chalk or an ancient scribe may have chiseled 


an inscription in a piece of rock with a metal implement. Often, those 
who want to know about the mechanism of intelligent design are not 
necessarily challenging the idea that information ultimately originates in 
thought. They want to know how, or by what material means, the 
intelligent agent responsible for the information in living systems 
transmitted that information to a material entity such as a strand of 
DNA. To use a term from philosophy, they want to know about “the 
efficient cause” at work. 

The answer is: We simply don’t know. We don’t have enough 
evidence or information about what happened, in the Cambrian 
explosion or other events in the history of life, to answer questions about 
what exactly happened, even though we can establish from the clues left 
behind that an intelligent designer played a causal role in the origin of 
living forms. 

An illustration from archaeology helps explain how this can be so 
(see Fig. 19.2). Years ago explorers of a remote island in the 
southwestern Pacific Ocean discovered a group of enormous stone 
figures. The figures displayed the distinctive shape of human faces. 
These figures left no doubt as to their ultimate origin in thought. 
Nevertheless, archeologists still don’t know the exact means by which 
they were carved or erected. The ancient head carvers might have used 
metallic hammers, rock chisels, or lasers for that matter. Though 
archaeologists lack the evidence to decide between various hypotheses 
about how the figures were constructed, they can still definitely infer 
that intelligent agents made them. In the same way, we can infer that an 
intelligence played a causal role in the origin of the Cambrian animals, 
even if we cannot decide what material means, if any, the designing 
intelligence used to transmit the information, or shape matter, or impart 
its design ideas to living form. Although the theory of intelligent design 
infers that an intelligent cause played a role in shaping life’s history, it 
does not say how the intelligent cause affected matter. Nor does it have 
to do so. 





Group of carvings of giant heads, called “Moais,” on Easter Island. 
Courtesy iStockphoto/Think-stock. 


There is a logical reason we cannot without further information 
determine the mechanism or means by which the intelligent agent 
responsible for life transmitted its design to matter. We can infer an 
intelligent cause from certain features of the physical world, because 
intelligence is known to be a necessary cause, the only known cause, of 
those features. That allows us to infer intelligence retrospectively as a 
cause by observing its distinctive effects. Nevertheless, we cannot 
establish a unique scenario describing how the intelligent agent 
responsible for life arranged or impressed its ideas on matter, because 
there are many different possible means by which an idea in the mind of 
an intelligent agent could be transmitted or instantiated in the physical 
world. 

There is another even more profound reason that intelligent design— 
indeed, science itself—may not be able to offer a completely 
mechanistic account of the instantiation of thought into matter. Robert 
Asher worries about how “a biological phenomenon, even if designed,” 
could be “simply willed into existence without an actual mechanism.” In 


Asher’s understanding, the uniformitarian principle asks for a precedent, 
a known cause that not only generates information, but translates 
immaterial thought into material reality, impressing itself on and shaping 
the physical world. Asher complains that the argument for intelligent 
design cannot cite such a precedent and is thus “anti-uniformitarian.” 

Yet a precedent comes very readily to mind, an intimately familiar 
one for us all. At present no one has any idea how our thoughts—the 
decisions and choices that occur in our conscious minds—affect our 
material brains, nerves, and muscles, going on to instantiate our will in 
the material world of objects. However, we know that is exactly what 
our thoughts do. We have no mechanistic explanation for the mystery of 
consciousness, nor what is called the “mind-body problem”—the enigma 
of how thought affects the material state of our brains, bodies, and the 
world that we affect with them. Yet there is no doubt that we can—as the 
result of events in our conscious minds called decisions or choices 
—‘“will into existence” information-rich arrangements of matter or 
otherwise affect material states in the world. Professor Asher did this 
when he wrote the chapter in his book—representing his ideas impressed 
as words onto a material object, a printed page—attempting to refute 
intelligent design. I am doing this right now. This example, 
representative of countless daily experiences in life, surely satisfies the 
demands of uniformitarianism. 

Though neuroscience can give no mechanistic explanation for 
consciousness or the mind-body problem, we also know that we can 
recognize the product of thought, the effect of intelligent design, in its 
distinctive information-rich manifestations. Professor Asher recognized 
evidence of thought when he read the text in my book; I did so when I 
read his; you are doing so right now. Thus, even though it remains 
entirely possible that we may never know how minds affect matter and, 
therefore, that there may always be a gap in our attempt to account for 
how a designing mind affected the material out of which living systems 
were formed, it does not follow that we cannot recognize evidence of the 
activity of mind in living systems. 





WHY IT MATTERS FOR SCIENCE 


But if proponents of intelligent design admit that they do not, or perhaps 
even cannot, answer the question of how the mind responsible for the 
design of animal life impressed its ideas on matter, why does it matter 
that we recognize the evidence for intelligent design at all? If intelligent 
design just replaces one mystery with another, why not limit ourselves to 
materialistic explanations after all, as methodological naturalism 
requires, and be content with accepting the mystery we already have? 
Wouldn’t that be simpler and more intellectually economical? 

Perhaps. But it puts the mystery in the wrong place. We do know of a 
cause that can produce the functional information necessary to build 
complex systems. But we do not know exactly how mind interfaces with 
matter. If we were to ask what caused the Rosetta Stone to arise, and 
then insist despite all evidence to the contrary that a purely material 
process is capable of producing the information-rich etchings on that 
stone, we would be deluding ourselves. The information etched into that 
black slab of igneous rock at the British Museum provides 
overwhelming evidence that an intelligent agent did cause those 
inscriptions. Any rule that prevents us from considering such an 
explanation diminishes the rationality of science, because it prevents 
scientists from considering a possibly—and in this case obviously—true 
explanation. And the truth matters, not least in science. For this reason, 
the “rules of science” should not commit us to rejecting possibly true 
theories before we even consider the evidence. But that is exactly what 
methodological naturalism does. 

Moreover, adhering to methodological naturalism and refusing to 
consider the evidence for intelligent design in life does not just affect the 
explanations that we are willing to consider for the origins and history of 
life. They also affect the questions we ask about life as it exists, and thus 
the entire biological research agenda that we pursue. 

An analogy to a human artifact again shows why. If we ask exactly 
how the scribe responsible for the inscriptions on the Rosetta Stone 
accomplished his or her task—with a metal chisel, a sharpened piece of 
obsidian, a diamond stylus, or some other material means—we may not 


have enough evidence to answer that question. Nevertheless, it will help 
archaeologists to know that they are looking at an artifact of intelligence, 
rather than a byproduct of strictly natural processes. This will lead them 
to ask other more relevant questions about the stone, such as: “What do 
the inscriptions mean?” “Who wrote them?” and “What do they tell us 
about the surrounding cultures at the time?” In a similar way, what we 
think about how animal life arose and developed will lead us to ask 
different questions about living forms—dquestions that we might never 
think to ask if we were assuming that they had arisen by a purely 
undirected mechanism such as natural selection. 

Intelligent agents and natural selection do their work very differently. 
The mutation and selection mechanism is a blind, trial-and-error process, 
one that must maintain or optimize functional advantage through a series 
of incremental steps. Given Darwinian assumptions, we would not 
expect to see structures or systems in living organisms that required 
foresight. Nor would we expect to see structures that needed to be 
produced all at once in large jumps rather than by a series of function- 
preserving incremental steps. We would, however, expect to see 
evidence of a trial-and-error process in the genomes of organisms. 

But what happens if we open ourselves to the possibility of detecting 
design in life? We know a lot about how intelligent designers do their 
work. Intelligent designers use many established design strategies (or 
“design patterns,” as engineers would say). They also have foresight that 
allows them to reach functional goals without the need to maintain 
function through a series of intermediate structures. They typically 
engineer new systems from scratch without relying on random, 
incremental, trial-and-error modifications in one system to produce 
another. 

Because these two different types of causes operate differently and 
often produce different types of structures and systems, scientists should 
expect living systems (and the history of life) to look differently 
depending upon which type of cause produced the organisms or 
structures in question. And these differing perspectives and expectations 


can lead scientists to ask different research questions and make different 
predictions about what we should find in the structure of life itself. 


THE ENCODE PROJECT AND AN ID PREDICTION 


In 2012, a dramatic confirmation of one such prediction made by 
advocates of intelligent design occurred in the field of genomics. Three 
leading science journals, Nature, Genome Research, and Genome 
Biology, published a series of groundbreaking papers reporting on the 
results of a massive study of the human genome called the ENCODE 
project (short for Encyclopedia of DNA Elements).*? The conclusion: at 
least 80 percent of the genome performs significant biological functions, 
“dispatching the widely held view that the human genome is mostly 
‘junk DNA.’ 4! 

The discovery challenged a long held neo-Darwinian interpretation of 
the genome. According to neo-Darwinism, the genome as a whole 
should display evidence of the random trial-and-error process that gave 
rise to new genetic information. The discovery in the 1970s that only a 
small percentage of the genome contains information for building 
proteins was hailed at the time as powerful confirmation of the 
Darwinian view of life. The noncoding regions of the genome were 
assumed to be nonfunctional detritus of the trial-and-error mutational 
process—the same process that produced the functional code in the 
genome. As a result, these noncoding regions were deemed “junk 
DNA,” including by no less a scientific luminary than Francis Crick.*” 

Because intelligent design asserts that an intelligent cause produced 
the genome, design advocates have long predicted that most of the 
nonprotein-coding sequences in the genome should perform some 
biological function, even if they do not direct protein synthesis. Design 
theorists do not deny that mutational processes might have degraded 
some previously functional DNA, but we have predicted that the 
functional DNA (the signal) should dwarf the nonfunctional DNA (the 
noise), and not the reverse. As William Dembski, a leading design 
proponent, predicted in 1998, “On an evolutionary view we expect a lot 


of useless DNA. If, on the other hand, organisms are designed, we 
expect DNA, as much as possible, to exhibit function.”*? 

The ENCODE project and other recent research in genomics have 
confirmed this prediction. As the lead article in Nature reported, 
ENCODE has “enabled us to assign biochemical functions for 80 
percent of the genome in particular outside of the well-studied protein- 
coding regions.”** Other research in genomics has shown that, overall, 
the noncoding regions of the genome function much like the operating 
system in a computer. Indeed, the noncoding regions of the genome 
direct the timing and regulate the expression of the data modules or 
coding regions of the genome, in addition to possessing myriad other 
functions.*° Before ENCODE, neo-Darwinists would often ask: If the 
information in DNA provides such compelling evidence for the activity 
of a designing intelligence, why is over 90 percent of the genome 
composed of functionless nonsense sequences? The latest genomics 
research now provides a ready answer to this question: it isn’t. 

The significance of these discoveries in genomics to the debate about 
design has passed largely unnoticed in the media. But repeated attempts 
to stigmatize the ENCODE researchers as aiding and abetting 
“intelligent design creationists” have inadvertently highlighted what is at 
stake. In this effort, a biochemist at the University of Toronto, Laurence 
A. Moran, emerged as point man. The Moran strategy centered on 
tarring scientists and science journalists who publicized ENCODE and 
its implications with the brush of “Intelligent Design Creationism’—an 
all too familiar conflation of intelligent design with a very different idea, 
the biblical literalism of young-earth creationism. When the 
distinguished journal Science selected ENCODE as one of the top ten 
science news stories of 2012, reminding readers that it had detonated the 
notion of junk DNA by revealing overwhelming functionality in the 
genome,*° Moran jeered, “Oh well, I guess PI just have to be content to 
point out that many scientists are as stupid as many Intelligent Design 
Creationists! ”^7 In the science world, as in the media, “creationist” is a 
dirty word; it’s like calling someone a Communist used to be in the 
1950s. Such attempts to stigmatize results that challenge a favored 





theory illustrate how an ideological monopoly in science can stifle 
inquiry and discussion. 

The demise of the idea of junk DNA illustrates too, in a more positive 
way, how a competing perspective can inspire research that contributes 
to new discovery. Although clearly not every scientist who performed 
research helping to establish the functional significance of nonprotein- 
coding DNA was inspired by the theory of intelligent design, at least one 
noteworthy scientist was. During the early part of the decade, before 
ENCODE made the headlines, this scientist published many articles 
challenging the idea of junk DNA based on genomics research that he 
was conducting at the National Institutes of Health. After the publication 
of ENCODE in 2012, his coauthor on many of those articles, the 
prominent University of Chicago geneticist James Shapiro, wrote an 
article in the Huffington Post commending the scientist for his 
groundbreaking research and for anticipating the ENCODE results years 
before. In the article, Shapiro acknowledged that he and his coauthor had 
“different evolutionary philosophies”—his charitable way of referring to 
his coauthor’s growing interest in the theory of intelligent design. 

Who was that other scientist? None other than Richard Sternberg, the 
evolutionary biologist who was punished for his openness to intelligent 
design while serving at the Smithsonian Institution (and the National 
Institutes of Health) in 2004. Around that time, Sternberg’s doubts about 
neo-Darwinism and his growing interest in intelligent design led him to 
consider the possibility that the majority of the genome could really be 
functional.4* His research subsequently confirmed what was for him, an 
idea inspired in part by intelligent design. 

In Signature in the Cell, I described many other discriminating 
predictions of the theory of intelligent design—predictions that differ 
from those of competing materialistic evolutionary theories—and how 
those predictions might help to guide new research in various 
subdisciplines of biology, including some in medicine. These predictions 
may also lead scientists to make new discoveries—discoveries that 
proponents of a competing perspective might not have been inclined to 
make—or to accept. 


OPEN VISTAS 


By now it should be clear why so many brilliant scientists have missed 
the evidence of design in the Cambrian explosion. Scott Todd, a 
biologist writing in Nature, succinctly stated the reason: “Even if all the 
data point to an intelligent designer, such a hypothesis is excluded from 
science because it is not naturalistic.”*? When scientists decide by fiat 
that intelligent design lies beyond the bounds of science, their decision 
will prevent them from considering this possibly or probably true 
explanation for the origin of animal form. But it will also deprive them 
of a new perspective that can generate new research questions and foster 
new avenues of discovery. Knowing this helps solve the final mystery of 
this book, but it also suggests a more productive way forward to address 
mysteries yet unsolved. Scientists committed to methodological 
naturalism have nothing to lose but their chains—fetters that bind them 
to a creaky and exhausted nineteenth-century materialism. The future 
lies open before them, and us. As we in the intelligent design research 
community like to say, let’s break some rules and follow the evidence 
wherever it leads. 


20 
What’s at Stake 


In the summer of 2002, I had the opportunity to hike up to the Burgess 
Shale with a group of geologists, geophysicists, and marine biologists. 
Our group also included my then eleven-year-old son and a teenage 
friend of his who was interested in the Cambrian fossils and the debate 
about Darwinism and design. 

When we got to the top of the mountain, I was unprepared for the 
impact the fossils would have on me. I had seen many fossils before, of 
course. But seeing these fossils—marine animals from the dawn of 
animal life at the top of a mountain with their beautifully preserved 
appendages and organs—rendered the idea of the “Cambrian explosion” 
a good deal less theoretical for me than it had been. These complex sea 
creatures, now brushed by the thin air at an elevation of 7,500 feet in the 
middle of the Canadian Rockies, had apparently arisen suddenly, almost 
from nothing by way of ancestral forms, in the sedimentary record. 
Everything about them cried out for a story—a big story. It set my mind 
and imagination racing (see Fig. 20.1). 

As wonderful as the fossils were, our trip to see them was made 
more memorable by two things that happened en route—one on the way 
up the mountain and one on the way down. As we were making our 
ascent, crossing a large talus slope—a section of the mountain void of 
vegetation and covered with only fragments of sedimentary rock—I 
heard my son unexpectedly call out to me from up at the front of our 
group. His voice had a trembling quality. I looked forward to see him, 
normally a fearless kid blessed by energy without bound, standing 
locked in place, pale and wide-eyed. I stepped around several of the 


other hikers on the trail to catch up with him. It turned out he was 
experiencing a kind of vertigo, though the mountain was not 
dangerously steep at that point (see Fig. 20.2). As he set out across the 
path that cut through the rocky slope, he had made the mistake of 
looking down the mountain. Without trees as a reference point and with 
hundreds of feet of loose rock fragments above and beneath him, he 
became disoriented and frightened. I steadied him as we walked in step, 
stride for stride, with me directly behind him across that open stretch of 
the mountain. Before long we were back to a place on the trail where 
trees and other plants appeared, providing a steadying presence as a 
point of reference. My son’s perspective quickly returned. He relaxed 
and soon was smiling and leaping confidently ahead of me again. 

On the way down the mountain, I had a striking interaction with a 
member of our group, who gave voice to a different kind of 
disorientation. It began as a conversation between my son’s friend and 
our official field guide, who had been assigned to us by the local 
Burgess Shale Geoscience Foundation. Our guide was a paleontologist 
and did a terrific job. He told many a fine story about the geological 
history of the surrounding formations, about the discovery of the fossils, 
and of course about the evolutionary history of animal life. In fact, just 
before we turned the final corner on the trail to ascend to a large 
collection of excellent fossils available for viewing at the top of the 
mountain, he slipped a statement of support for evolutionary orthodoxy 
into his description of the fossil site. Our guide was clearly unaware that 
many of us in the group knew the fossils we were about to see 
challenged the standard Darwinian story. 
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FIGURE 20.1 
A trilobite fossil found at the Burgess Shale. Courtesy Michael 
Melford/National Geographic Image Collection/Getty Images. 





FIGURE 20.2 
Figure 20.a (left): Photograph of the author and his son Jamie at the 
Walcott Quarry of the Burgess Shale in British Columbia, Canada. 


Figure 20.2b (right): Pausing for a moment to reflect on the slope below 
the Burgess Shale outcrop. 


We hadn’t made an issue of our views, of course, but nearly all the 
scientists on the hike were skeptical of neo-Darwinism. Paul Chien, the 
University of San Francisco marine biologist who had worked with J. Y. 
Chen in China on the sponge embryo fossils, was on the trip, and he had 
more than a passing acquaintance with Cambrian-era paleontology—as 
did several of the Canadian geologists with us. Still, not wanting to 
introduce any needless discord, we carefully avoided engaging the 
issue. We just wanted to see the fossils. 

As we descended the mountain, however, my son’s young friend 
asked our guide how he squared what we had just seen with his support 
for Darwinian evolution. The guide at first maintained his commitment 
to the Darwinian party line. He said he thought that Darwin would feel 
“vindicated” by the discovery of the Burgess fossils. This proved too 
much for the precocious, intellectual teenager, who loudly blurted out, 
“What?! Darwin would feel vindicated? By the sudden appearance of all 
those animals without any ancestors in the fossil record! Are you 
kidding?!” 

You would have to know this endearing young man to understand 
how his uninhibited outburst only charmed and amused our guide. But 
fortunately it did. The rest of us, however, were initially mortified. This 
was the discussion we were trying to avoid, knowing exactly the intense 
emotions it often provokes. With scientists, it is generally safer to 
discuss religion and politics. 

Nevertheless, to his credit, our guide took the challenge in stride. He 
explained how the Burgess fossils demonstrated evidence of change 
over time, how the rock column showed the great age of the earth, and 
how the discovery of the fossils high on a mountain revealed the 
evolution of the planet. Our young friend had spent too much time 
reading up on the subject to let the point go at that. He brushed aside the 
issue of the earth’s age, which like our guide he reckoned in billions of 
years, and assured the man that he accepted evidence for change over 


time in the sedimentary record. He didn’t question evolution in that 
sense. He questioned Darwinian evolution. “Where is the evidence of 
gradual change?” he demanded, as his teenage voice cracked with 
excitement as it moved into the upper register. He continued, “What 
mechanism could produce so many new animals so quickly?” 

An odd thing happened then. The paleontologist now leading us 
down the trail suddenly ceased to act the part of “guide.” He dropped 
any pretense of superior authority and said, “You know, I’ve wondered 
about that myself.” I thought I could hear in his voice the candid 
amazement of the fourteen-year-old boy he once was. 

“How do you explain it?” he asked my son’s friend. 

Our young spokesman confidently piped up and asserted, “Intelligent 
design, of course!” 

At which point, our guide began to ask the probing questions. Soon 
my son’s friend had exhausted his store of knowledge and began to look 
to me to join the conversation. I did so, reluctantly at first. I explained 
the information argument for intelligent design and how the Cambrian 
explosion contributed to it. Our guide asked me the hard questions: How 
can we detect design? Is intelligent design science? Aren’t we just 
arguing from ignorance and giving up on science, or at least mainstream 
evolutionary science, too soon? He also wanted to know who I 
personally thought the designer was. His challenges were tough and 
honest. A terrific conversation ensued. 

When we reached the trailhead, he surprised me, thanking me for the 
conversation—and thanking my young friend for starting it. He then 
spoke a bit more personally and revealed that he sometimes found 
thinking about biological origins disturbing. He said that as a scientist 
he was committed to the evolutionary perspective. But he also found its 
denial of purpose depressing. He wondered if there was some way to 
affirm both science and the kind of purpose and meaning in life that 
religion speaks about. As we parted ways, he said he would like to learn 
more about intelligent design. He told me he was intrigued by the 
perspective we were developing. I felt that we had made a genuine 


human connection rather than, as sometimes happens in the evolution 
debate, merely flinging assertions at one another. 

Over the years, as I’ve researched and thought about biological 
origins, I have had numerous similar conversations with people of many 
persuasions and backgrounds: religious and nonreligious; scientists, 
engineers, medical doctors; businessmen and -women, appliance 
repairmen and taxi drivers. These conversations usually start innocently 
enough as the result of someone asking me what I do for a living. 
Though I often euphemize my response (“I work for a research 
organization”) to avoid getting trapped in a heavy conversation on an 
airplane or over a broken dishwasher, often the conversations come 
whether I want them or not. People are interested in how life began and 
they instinctively understand that whatever theory we adopt has larger 
philosophical, religious, or worldview implications. People are usually 
energized by considering those larger implications and questions. Many 
would like to find a way to harmonize the evidence from science with a 
view of the world that addresses their deepest existential longings as 
human beings, their yearning for purpose and significance. But like our 
guide, many have been frustrated by the difficulty of arriving at a 
coherent synthesis. 

It’s not hard to see why. On the one hand, many people of faith have 
little real interest in what science has to say about life’s origins. Indeed, 
many well-meaning religious believers have adopted a view of the 
relationship between science and faith that rejects the testimony of 
science as irrelevant or even dangerous and affirms that just reading the 
Bible will give all the insight needed to understand how life came to be. 
Their approach does not really attempt to harmonize faith and science, 
since it takes faith in the Bible, and often a particular interpretation of 
the Bible, as the only reliable source of information about life’s origin. 

On the other hand, many scientists and others who think that science 
has something to teach us about the big questions have started by 
assuming the neo-Darwinian account of biological origins, despite its 
many scientific difficulties—and despite its denial of any role for 
purposive intelligence in the history of life. 


In particular, two popular ideas about how Darwinism informs 
worldview have come to different conclusions about the worldview it 
affirms—or allows. The first view, the “New Atheism,” has been 
articulated by spokesmen such as Richard Dawkins in his book The God 
Delusion and the late Christopher Hitchens in God Is Not Great.' It 
purports to refute the existence of God as “a failed hypothesis,”” as 
another New Atheist book puts it. Why? Because, according to Dawkins 
and others, there is no evidence of design in nature. Indeed, Dawkins’s 
argument for atheism hinges upon his claim that natural selection and 
random mutation can explain away all “appearances” of design in 
nature. And since, he asserts, the design argument always provided the 
strongest argument for believing in God’s existence, belief in God, he 
concludes, is extremely improbable—tantamount to “a delusion.” For 
the New Atheists, Darwinism makes theistic belief both implausible and 
unnecessary. As Dawkins has famously put it, “Darwin made it possible 
to be an intellectually fulfilled atheist.”? 

The New Atheists took the publishing world by storm in 2006 when 
The God Delusion first appeared. But nothing about the “New” Atheism 
was actually “new.” Instead, it represents a popularization of a science- 
based philosophy, called scientific materialism, that came into currency 
among scientists and philosophers during the late nineteenth century in 
the wake of the Darwinian revolution. For many scientists and scholars 
at the time, a scientifically informed worldview was a materialistic 
worldview in which entities such as God, free will, mind, soul, and 
purpose played no role. Scientific materialism, following classical 
Darwinism, denied evidence of any design in nature and, therefore, any 
ultimate purpose to human existence. As British philosopher and 
mathematician Bertrand Russell put it early in the twentieth century, 
“Man is the product of causes which had no prevision of the end they 
were achieving” and which predestine him “to extinction in the vast 
death of the solar system.”* 

An alternative and increasingly popular view is known as theistic 
evolution. Popularized by Christian geneticist Francis Collins in his 
book The Language of God (also published in 2006),° this perspective 


affirms the existence of God and the Darwinian account of biological 
origins. Yet it provides few details about how God might or might not 
influence the evolutionary process, or how to reconcile seemingly 
contradictory claims in the Darwinian and Judeo-Christian accounts of 
origins. For example, Collins has declined to say whether he thinks God 
in any way directed or guided the evolutionary process, though he 
affirms neo-Darwinism, which specifically denies that natural selection 
is guided in any way. Darwinism and neo-Darwinism insist that the 
appearance of design in living organisms is an illusion because the 
mechanism that produces that appearance is unguided and undirected. 
Does God, in Collins’s view, guide the unguided process of natural 
selection? He, and many other theistic evolutionists, don’t say. This 
ambiguity has made an uneasy reconciliation of science and faith 
possible, but it has also left many questions unanswered. In fairness, 
many theistic evolutionists would argue that not all such questions can 
be answered, because science and faith occupy separate, non- 
overlapping realms of inquiry, knowledge, and experience. But that 
answer itself underscores the limits of the harmonization of science and 
faith that Collins and others holding his view has achieved. 

The argument of this book presents a scientific challenge to both of 
these views. In the first place, the evidence and arguments we have seen 
show that the scientific premise of the New Atheist argument is flawed. 
The mechanism of mutation and natural selection does not have the 
creative power attributed to it and, thus, cannot explain all 
“appearances” of design in life. The neo-Darwinian mechanism does not 
explain, for example, either the new genetic or epigenetic information 
necessary to produce fundamentally new animal body plans. 

This book has presented four separate scientific critiques 
demonstrating the inadequacy of the neo-Darwinian mechanism, the 
mechanism that Dawkins assumes can produce the appearance of design 
without intelligent guidance. It has shown that the neo-Darwinian 
mechanism fails to account for the origin of genetic information 
because: (1) it has no means of efficiently searching combinatorial 
sequence space for functional genes and proteins and, consequently, (2) 


it requires unrealistically Jong waiting times to generate even a single 
new gene or protein. It has also shown that the mechanism cannot 
produce new body plans because: (3) early acting mutations, the only 
kind capable of generating large-scale changes, are also invariably 
deleterious, and (4) genetic mutations cannot, in any case, generate the 
epigenetic information necessary to build a body plan. Thus, despite the 
commercial success of The God Delusion and its wide cultural currency, 
the New Atheist philosophy lacks credibility because it has based its 
understanding of the metaphysical implications of modern science on a 
scientific theory that itself lacks credibility—as even many leading 
evolutionary biologists now acknowledge.° 

Second, this book poses a strong challenge to theistic evolutionists 
such as Francis Collins for many of the same scientific reasons. Collins 
places great trust in modern Darwinism as the unifying theory of 
biology, but seems completely unaware of the formidable scientific 
problems now afflicting the theory—ain particular, the challenges to the 
creative power of the natural selection/mutation mechanism. He makes 
no attempt to address or answer any of these challenges. In addition, 
many of his arguments for universal common descent—the defense of 
which was his main concern in The Language of God—are based upon 
the alleged presence of nonfunctional or “junk” elements in the 
genomes of different organisms. Though the theory of intelligent design, 
which Collins says he opposes, does not necessarily challenge this part 
(common descent) of Darwinian theory, the factual basis of his 
arguments has now also largely evaporated as the result of ENCODE 
and other developments in genomics.’ Thus, this popular view of 
biological origins, and its conception of God’s relationship to the natural 
world, now stands starkly at odds with the evidence. But why attempt to 
reconcile traditional Christian theology with Darwinian theory, as 
Collins tries to do, if the theory itself has begun to collapse? 

The perspective of this book offers a potentially more coherent and 
satisfying way of addressing the big questions, of synthesizing science 
and metaphysics (or faith), than either of the currently popular views on 
offer. The Cambrian explosion, like evolutionary theory itself, raises 


larger worldview questions precisely because it raises questions of 
origins and of design, and with them, the question that all worldviews 
must address: What is the thing or the entity from which everything 
comes? But unlike strict Darwinian materialism and the New Atheism 
built atop it, the theory of intelligent design affirms the reality of a 
designer—a mind or personal intelligence behind life. This case for 
design restores to Western thought the possibility that human life in 
particular may have a purpose or significance beyond temporary 
material utility. It suggests the possibility that life may have been 
designed by an intelligent person, indeed, one that many would identify 
as God. 

Unlike the theistic evolution of Francis Collins, however, the theory 
of intelligent design does not seek to confine the activity of such an 
agency to the beginning of the universe, conveying the impression of a 
decidedly remote and impersonal deistic entity. Nor does the theory of 
intelligent design merely assert the existence of a creative intelligence 
behind life. It identifies and detects activity of the designer of life, and 
does so at different points in the history of life, including the explosive 
show of creativity on display in the Cambrian event. The ability to 
detect design makes belief in an intelligent designer (or a creator, or 
God) not only a tenet of faith, but something to which the evidence of 
nature now bears witness. In short, it brings science and faith into real 
harmony. 

Just as importantly, perhaps, the case for design supports us in our 
existential confrontation with the void and the seeming meaninglessness 
of physical existence—the sense of survival for survival’s sake that 
follows inexorably from the materialist worldview. Richard Dawkins 
and other New Atheists may find it untroubling, even amusing and 
certainly profitable, to muse over the prospect of a universe without 
purpose. But for the vast majority of thoughtful people, that idea is 
tinged with terror. Modern life suspends many of us, so we feel, high 
over a chasm of despair. It provokes feelings of dizzying anxiety—in a 
word, vertigo. The evidence of a purposeful design behind life, on the 
other hand, offers the prospect of significance, wholeness, and hope. 


As my son walked out across the mountain high above the Yoho 
Valley, he was surrounded by many slabs of rock containing some of the 
very fossils we had come to see. But as he surveyed that barren portion 
of landscape, he lost perspective on where he was and what he had 
come to do. Without landmarks or steadying points of reference, he felt 
as if he were lost in a sea of sensory impressions. Without his sense of 
balance, he feared even to take a step. He called out for his father. 

It occurred to me only much later how closely his experience 
parallels our own as human beings trying to make sense of the world 
around us. To gain a true picture of the world and our place in it we 
need facts—empirical data. But we also need perspective, sometimes 
called wisdom, the reference points that a coherent view of the world 
provides. Historically, that wisdom was provided for many men and 
women by the traditions of Western monotheism—by our belief in God. 
The theory of intelligent design generates both excitement and loathing 
because, in addition to providing a compelling explanation of the 
scientific facts, it holds out the promise of help in integrating two things 
of supreme importance—science and faith—that have long been seen as 
at odds.® 

The theory of intelligent design is not based upon religious belief, 
nor does it provide a proof for the existence of God. But it does have 
faith-affirming implications precisely because it suggests the design we 
observe in the natural world is real, just as a traditional theistic view of 
the world would lead us to expect. Of course, that by itself is not a 
reason to accept the theory. But having accepted it for other reasons, it 
may be a reason to find it important. 
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Plate 1. Mt. Maotian, near Chengjiang, China. The famous Chengjiang 
fossil site was discovered near the top of this mountain. Courtesy 
Illustra Media (all). 





Plate 2. The original discovery site of the first Chengjiang fossils. Today 
this area is a preserve. 





Plate 3. Looking down from Mt. Maotian, with the large Fuxian Lake in 
the distant foreground. 





Plate 4. The opabinid Jianfengia multisegmentalis. Courtesy Paul 
Chien. Unless otherwise noted, all fossils shown on this and subsequent 
color pages are from the Maotianshan Shale near Chengjiang, China. 
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Plate 5. A fossilized comb jelly (a ctenophore). Species: Maotianoascus 
octonarius. Courtesy J. Y. Chen. 





Plates 6 & 7. Priapulid worms of the species Maotianshania cylindrica. 
Courtesy Paul Chien and Illustra Media. 





Plate 8. A lobopodian “worm with legs” of the genus Microdictyon. 





Plate 9. Another lobopod worm. Species: Microdictyon sinicum. 





Plate 10. Luolishania longicruris, a rare lobopod of the Maotianshan 
Shale. 





Plate 11. The extinct and enigmatic Vetulicolia, thought to represent 
either an arthropod, a tunicate- like urochordate, or a distinct phylum. 
Courtesy Illustra Media (all). 





Plates 12 & 13. Two specimens of the arthropod Leanchoilia. The top 
specimen shows its distinctive long frontal whiplike appendages. 





Plate 14. Chengjiangocaris longiformis (below), a rare arthropod known 
only from the Chengjiang biota. Courtesy Illustra Media (all). 

Two fossils from the dark-colored Burgess Shale, near the town of Field 
in British Columbia, Canada. Note the contrast in color to the fossils 
from the honey-colored Maotianshan Shale depicted on the previous and 
subsequent pages. 





Plate 15. A fossil of the arthropod Marrella from the Burgess Shale. 


Courtesy Illustra Media. 





Plate 16 (below). The shrimp-like arthropod Waptia. Courtesy Illustra 
Media. 
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Plate 17. The arthropod Fuxianhuia protensa. Note its exquisitely 
preserved eyes, antennae, segmented body, and forked tail. Courtesy 
Illustra Media. 





Plate 18. A fossil of the arthropod Waptia, from the honey- colored 
Maotianshan Shale. Courtesy Paul Chien. 





Plate 19. A fossil of an Anomalocaris, whose large eyes and jaws made 
it the fiercest predator in the Cambrian seas. Some reached a length of 
one meter. Courtesy J. Y. Chen. 





Plates 20-22. A montage of beautifully preserved trilo-bites from the 
Maotianshan Shale, including a specimen of Eoredlichia intermedia 
(left). Courtesy Illustra Media (all). 











Plate 23. A specimen of the trilobite Kuanyangia pustulosa from the 
Maotianshan Shale near Chengjiang, China. Courtesy Illustra Media. 
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